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Abstract 

In physics, Feynman diagrams are used to reason about quantum processes. In the 1980s, it became clear 
that underlying these diagrams is a powerful analogy between quantum physics and topology. Namely, a lin¬ 
ear operator behaves very much like a ‘cobordism’: a manifold representing spacetime, going between two 
manifolds representing space. This led to a burst of work on topological quantum field theory and ‘quan¬ 
tum topology’. But this was just the beginning: similar diagrams can be used to reason about logic, where 
they represent proofs, and computation, where they represent programs. With the rise of interest in quantum 
cryptography and quantum computation, it became clear that there is extensive network of analogies between 
physics, topology, logic and computation. In this expository paper, we make some of these analogies pre¬ 
cise using the concept of ‘closed symmetric monoidal category’. We assume no prior knowledge of category 
theory, proof theory or computer science. 


1 Introduction 

Category theory is a very general formalism, but there is a certain special way that physicists use categories 
which turns out to have close analogues in topology, logic and computation. A category has objects and mor- 
phisms, which represent things and ways to go between things. In physics, the objects are often physical systems, 
and the morphisms are processes turning a state of one physical system into a state of another system — per¬ 
haps the same one. In quantum physics we often formalize this by taking Hilbert spaces as objects, and linear 
operators as morphisms. 
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Sometime around 1949, Feynman m realized that in quantum field theory it is useful to draw linear oper¬ 
ators as diagrams: 



This lets us reason with them pictorially. We can warp a picture without changing the operator it stands for: all 
that matters is the topology, not the geometry. In the 1970s, Penrose realized that generalizations of Feynman 
diagrams arise throughout quantum theory, and might even lead to revisions in our understanding of space- 
time (ZSi- In the 1980s, it became clear that underlying these diagrams is a powerful analogy between quan¬ 
tum physics and topology! Namely, a linear operator behaves very much like a ‘cobordism’ — that is, an 
n-dimensional manifold going between manifolds of one dimension less: 



String theory exploits this analogy by replacing the Feynman diagrams of ordinary quantum field theory with 
2-dimensional cobordisms, which represent the worldsheets traced out by strings with the passage of time. The 
analogy between operators and cobordisms is also important in loop quantum gravity and — most of all — the 
more purely mathematical discipline of ‘topological quantum field theory’. 

Meanwhile, quite separately, logicians had begun using categories where the objects represent propositions 
and the morphisms represent proofs. The idea is that a proof is a process going from one proposition (the 
hypothesis) to another (the conclusion). Later, computer scientists started using categories where the objects 
represent data types and the morphisms represent programs. They also started using ‘flow charts’ to describe 
programs. Abstractly, these are very much like Feynman diagrams! 

The logicians and computer scientists were never very far from each other. Indeed, the ‘Curry-Howard 
correspondence’ relating proofs to programs has been well-known at least since the early 1970s, with roots 
stretching back earlier E3E21. But, it is only in the 1990s that the logicians and computer scientists bumped 
into the physicists and topologists. One reason is the rise of interest in quantum cryptography and quantum 
computation l28l . With this, people began to think of quantum processes as forms of information processing, 
and apply ideas from computer science. It was then realized that the loose analogy between flow charts and 
Feynman diagrams could be made more precise and powerful with the aid of category theory 0- 

By now there is an extensive network of interlocking analogies between physics, topology, logic and com¬ 
puter science. They suggest that research in the area of common overlap is actually trying to build a new science: 


a general science of systems and processes. Building this science will be very difficult. There are good reasons 
for this, but also bad ones. One bad reason is that different fields use different terminology and notation. 

The original Rosetta Stone, created in 196 BC, contains versions of the same text in three languages: de¬ 
motic Egyptian, hieroglyphic script and classical Greek. Its rediscovery by Napoleon’s soldiers let modem 
Egyptologists decipher the hieroglyphs. Eventually this led to a vast increase in our understanding of Egyptian 
culture. 

At present, the deductive systems in mathematical logic look like hieroglyphs to most physicists. Similarly, 
quantum field theory is Greek to most computer scientists, and so on. So, there is a need for a new Rosetta Stone 
to aid researchers attempting to translate between fields. TableQ] shows our guess as to what this Rosetta Stone 
might look like. 


Category Theory 

Physics 

Topology 

Logic 

Computation 

object 

system 

manifold 

proposition 

data type 

morphism 

process 

cobordism 

proof 

program 


Table 1: The Rosetta Stone (pocket version) 


The rest of this paper expands on this table by comparing how categories are used in physics, topology, logic, 
and computation. Unfortunately, these different fields focus on slightly different kinds of categories. Though 
most physicists don’t know it, quantum physics has long made use of ‘compact symmetric monoidal categories’. 
Knot theory uses ‘compact braided monoidal categories’, which are slightly more general. However, it became 
clear in the 1990’s that these more general gadgets are useful in physics too. Logic and computer science used 
to focus on ‘cartesian closed categories’ — where ‘cartesian’ can be seen, roughly, as an antonym of ‘quantum’. 
However, thanks to work on linear logic and quantum computation, some logicians and computer scientists have 
dropped their insistence on cartesianness: now they study more general sorts of ‘closed symmetric monoidal 
categories’. 

In Section |2] we explain these concepts, how they illuminate the analogy between physics and topology, 
and how to work with them using string diagrams. We assume no prior knowledge of category theory, only 
a willingness to learn some. In Section [3] we explain how closed symmetric monoidal categories correspond 
to a small fragment of ordinary propositional logic, which also happens to be a fragment of Girard’s ‘linear 
logic’ f4fl . In Section |T] we explain how closed symmetric monoidal categories correspond to a simple model 
of computation. Each of these sections starts with some background material. In Section [5] we conclude by 
presenting a larger version of the Rosetta Stone. 

Our treatment of all four subjects — physics, topology, logic and computation — is bound to seem sketchy, 
narrowly focused and idiosyncratic to practitioners of these subjects. Our excuse is that we wish to empha¬ 
size certain analogies while saying no more than absolutely necessary. To make up for this, we include many 
references for those who wish to dig deeper. 
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2 The Analogy Between Physics and Topology 

2.1 Background 

Currently our best theories of physics are general relativity and the Standard Model of particle physics. The first 
describes gravity without taking quantum theory into account; the second describes all the other forces taking 
quantum theory into account, but ignores gravity. So, our world-view is deeply schizophrenic. The field where 
physicists struggle to solve this problem is called quantum gravity, since it is widely believed that the solution 
requires treating gravity in a way that takes quantum theory into account. 


Physics 

Topology 

Hilbert space 

(n - l)-dimensional manifold 

(system) 

(space) 

operator between 

cobordism between 

Hilbert spaces 

(n - l)-dimensional manifolds 

(process) 

(spacetime) 

composition of operators 

composition of cobordisms 

identity operator 

identity cobordism 


Table 2: Analogy between physics and topology 


Nobody is sure how to do this, but there is a striking similarity between two of the main approaches: string 
theory and loop quantum gravity. Both rely on the analogy between physics and topology shown in Table |5| On 
the left we have a basic ingredient of quantum theory: the category Hilb whose objects are Hilbert spaces, used 
to describe physical systems, and whose morphisms are linear operators, used to describe physical processes. 
On the right we have a basic structure in differential topology: the category nCob. Here the objects are (n - 1)- 
dimensional manifolds, used to describe space, and whose morphisms are /(-dimensional cobordisms, used to 
describe spacetime. 

As we shall see, Hilb and nCob share many structural features. Moreover, both are very different from 
the more familiar category Set, whose objects are sets and whose morphisms are functions. Elsewhere we 
have argued at great length that this is important for better understanding quantum gravity 03 and even the 
foundations of quantum theory m. The idea is that if Hilb is more like nCob than Set, maybe we should stop 
thinking of a quantum process as a function from one set of states to another. Instead, maybe we should think 
of it as resembling a ‘spacetime’ going between spaces of dimension one less. 

This idea sounds strange, but the simplest example is something very practical, used by physicists every day: 
a Feynman diagram. This is a 1-dimensional graph going between 0-dimensional collections of points, with 
edges and vertices labelled in certain ways. Feynman diagrams are topological entities, but they describe linear 
operators. String theory uses 2-dimensional cobordisms equipped with extra structure — string worldsheets — 
to do a similar job. Loop quantum gravity uses 2d generalizations of Feynman diagrams called ‘spin foams’ 
(9). Topological quantum field theory uses higher-dimensional cobordisms EQ. In each case, processes are 
described by morphisms in a special sort of category: a ‘compact symmetric monoidal category’. 

In what follows, we shall not dwell on puzzles from quantum theory or quantum gravity. Instead we take a 
different tack, simply explaining some basic concepts from category theory and showing how Set, Hilb, nCob 
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and categories of tangles give examples. A recurring theme, however, is that Set is very different from the other 
examples. 

To help the reader safely navigate the sea of jargon, here is a chart of the concepts we shall explain in this 
section: 



The category Set is cartesian closed, while Hilb and nCob are compact symmetric monoidal. 

2.2 Categories 

Category theory was born around 1945, with Eilenberg and Mac Lane BTl defining ‘categories’, ‘functors’ 
between categories, and ‘natural transformations’ between functors. By now there are many introductions to the 
subject mmm , including some available for free online 12011461 . Nonetheless, we begin at the beginning: 

Definition 1 A category C consists of: 

• a collection of objects, where ifX is an object ofC we write X e C, and 

• for every pair of objects (A, Yj, a set hom(X, Y) of morphisms from X to Y. We call this set horn (A, Y ) a 
homset. Iff e hom(X, 7), then we write f: X —> Y. 

such that: 


• for every object X there is an identity morphism lx: X —» X\ 


























• morphisms are composable: given f:X —> Y andg:Y —» Z, there is a composite morphism gf:X —» Z; 
sometimes also written g ° /. 

• an identity morphism is both a left and a right unit for composition: if f: X —» F, ?/?en /lx = / = ly/; 
and 

• composition is associative: (hg)f = h(gf) whenever either side is well-defined. 

Definition 2 We say a morphism f:X—> Y is an isomorphism if it has an inverse — that is, there exists another 
morphism g:Y —> X such that gf = \ x and fg= ly. 

A category is the simplest framework where we can talk about systems (objects) and processes (morphisms). 
To visualize these, we can use ‘Feynman diagrams’ of a very primitive sort. In applications to linear algebra, 
these diagrams are often called ‘spin networks’, but category theorists call them ‘string diagrams’, and that is 
the term we will use. The term ‘string’ here has little to do with string theory: instead, the idea is that objects of 
our category label ‘strings’ or ‘wires’: 



and morphisms f:X —» Y label ‘black boxes’ with an input wire of type X and an output wire of type Y: 


We compose two morphisms by connecting the output of one black box to the input of the next. So, the 
composite of /: X —» Y and g:Y —» Z looks like this: 


Associativity of composition is then implicit: 



is our notation for both h(gf) and (hg)f. Similarly, if we draw the identity morphism 1*: X —s> X as a piece of 
wire of type X: 



then the left and right unit laws are also implicit. 

There are countless examples of categories, but we will focus on four: 

• Set: the category where objects are sets. 

• Hilb: the category where objects are finite-dimensional Hilbert spaces. 

• nCob: the category where morphisms are n-dimensional cobordisms. 

• Tang^: the category where morphisms are /c-codimensional tangles. 

As we shall see, all four are closed symmetric monoidal categories, at least when k is big enough. However, the 
most familiar of the lot, namely Set, is the odd man out: it is ‘cartesian’. 

Traditionally, mathematics has been founded on the category Set, where the objects are sets and the mor¬ 
phisms are functions. So, when we study systems and processes in physics, it is tempting to specify a system by 
giving its set of states, and a process by giving a function from states of one system to states of another. 

However, in quantum physics we do something subtly different: we use categories where objects are Hilbert 
spaces and morphisms are bounded linear operators. We specify a system by giving a Hilbert space, but this 
Hilbert space is not really the set of states of the system: a state is actually a ray in Hilbert space. Similarly, a 
bounded linear operator is not precisely a function from states of one system to states of another. 

In the day-to-day practice of quantum physics, what really matters is not sets of states and functions between 
them, but Hilbert space and operators. One of the virtues of category theory is that it frees us from the ‘Set¬ 
centric’ view of traditional mathematics and lets us view quantum physics on its own terms. As we shall see, 
this sheds new light on the quandaries that have always plagued our understanding of the quantum realm El- 
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To avoid technical issues that would take us far afield, we will take Hilb to be the category where objects 
axe finite-dimensional Hilbert spaces and morphisms are linear operators (automatically bounded in this case). 
Finite-dimensional Hilbert spaces suffice for some purposes; infinite-dimensional ones are often important, but 
treating them correctly would require some significant extensions of the ideas we want to explain here. 

In physics we also use categories where the objects represent choices of space, and the morphisms represent 
choices of spacetime. The simplest is nCob, where the objects are (n — l)-dimensional manifolds, and the 
morphisms are n-dimensional cobordisms. Glossing over some subtleties that a careful treatment would discuss 
l8H . a cobordism /: X —> Y is an n-dimensional manifold whose boundary is the disjoint union of the (n - 1)- 
dimensional manifolds X and Y. Here are a couple of cobordisms in the case n = 2: 



We compose them by gluing the ‘output’ of one to the ‘input’ of the other. So, in the above example gf: X —> Z 
looks like this: 

X 



Another kind of category important in physics has objects representing collections of particles, and mor¬ 
phisms representing their worldlines and interactions. Feynman diagrams are the classic example, but in these 
diagrams the ‘edges’ are not taken literally as particle trajectories. An example with closer ties to topology is 
Tang,. 

Very roughly speaking, an object in Tang, is a collection of points in a ^-dimensional cube, while a morphism 
is a ‘tangle’: a collection of arcs and circles smoothly embedded in a (k + l)-dimensional cube, such that the 
circles lie in the interior of the cube, while the arcs touch the boundary of the cube only at its top and bottom, and 
only at their endpoints. A bit more precisely, tangles are ‘isotopy classes’ of such embedded arcs and circles: 
this equivalence relation means that only the topology of the tangle matters, not its geometry. We compose 
tangles by attaching one cube to another top to bottom. 

More precise definitions can be found in many sources, at least for k = 2, which gives tangles in a 3- 
dimensional cube l42ll58M8Tll89ll97lll01l . But since a picture is worth a thousand words, here is a picture of a 





rphisminTang 2 : 


X 



Note that we can think of a morphism in Tang^. as a 1-dimensional cobordism embedded in a k-dimensional 
cube. This is why Tang^ and nCob behave similarly in some respects. 

Here are two composable morphisms in Tang,: 



and here is their composite: 



Since only the tangle’s topology matters, we are free to squash this rectangle into a square if we want, but we 
do not need to. 

It is often useful to consider tangles that are decorated in various ways. For example, in an ‘oriented’ tangle, 
each arc and circle is equipped with an orientation. We can indicate this by drawing a little arrow on each 
curve in the tangle. In applications to physics, these curves represent worldlines of particles, and the arrows 
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say whether each particle is going forwards or backwards in time, following Feynman’s idea that antiparticles 
are particles going backwards in time. We can also consider ‘framed’ tangles. Here each curve is replaced by 
a ‘ribbon’. In applications to physics, this keeps track of how each particle twists. This is especially important 
for fermions, where a 2n twist acts nontrivially. Mathematically, the best-behaved tangles are both framed and 
oriented II31 1831 , and these are what we should use to define Tang^,. The category nCob also has a framed 
oriented version. However, these details will barely matter in what is to come. 

It is difficult to do much with categories without discussing the maps between them. A map between cate¬ 
gories is called a ‘functor’: 

Definition 3 A functor F: C —» Dfrom a category C to a category D is a map sending: 

• any object X e C to an object F(X) e D, 

• any morphism f: X —> Y in C to a morphism F(f ): F(X) —> F( Y) in D, 
in such a way that: 

• F preserves identities: for any object X € C, F(l x ) = 1 

• F preserves composition: for any pair ofmorphisms f:X -> Y, g:Y —> Z in C, F(gf) = F{g)F(J). 

In the sections to come, we will see that functors and natural transformations are useful for putting extra 
structure on categories. Here is a rather different use for functors: we can think of a functor F: C —> D as giving 
a picture, or ‘representation’, of C in D. The idea is that F can map objects and morphisms of some ‘abstract’ 
category C to objects and morphisms of a more ‘concrete’ category D. 

For example, consider an abstract group G. This is the same as a category with one object and with all 
morphisms invertible. The object is uninteresting, so we can just call it •, but the morphisms are the elements 
of G, and we compose them by multiplying them. From this perspective, a representation of G on a finite¬ 
dimensional Hilbert space is the same as a functor F:G —» Hilb. Similarly, an action of G on a set is the same 
as a functor F: G —> Set. Both notions are ways of making an abstract group more concrete. 

Ever since Lawvere’s 1963 thesis on functorial semantics J69), the idea of functors as representations has 
become pervasive. However, the terminology varies from field to field. Following Lawvere, logicians often call 
the category C a ‘theory’, and call the functor F: C —> D a ‘model’ of this theory. Other mathematicians might 
call F an ‘algebra’ of the theory. In this work, the default choice of D is usually the category Set. 

In physics, it is the functor F: C —* D that is called the ‘theory’. Here the default choice of D is either the 
category we are calling Hilb or a similar category of infinite-dimensional Hilbert spaces. For example, both 
‘conformal field theories’ f85l and topological quantum held theories 0 can be seen as functors of this sort. 

If we think of functors as models, natural transformations are maps between models: 

Definition 4 Given two functors F,F’:C —> D, a natural transformation a: F => F' assigns to every object X 
in C a morphism ax' F(X) —> F'(X) such that for any morphism f:X—> Y in C, the equation ay F(f) = F' (/) a x 


10 


holds in D. In other words, this square commutes: 


(Going across and then down equals going down and then across.) 


Definition 5 A natural isomorphism between functors F,F':C —> D is a natural transformation a: F => F' 
such that ax is an isomorphism for every X e C. 


For example, suppose F,F':G —> Hilb are functors where G is a group, thought of as a category with one 
object, say •. Then, as already mentioned, F and F' are secretly just representations of G on the Hilbert spaces 
F(») and F'(»). A natural transformation a: F => F' is then the same as an intertwining operator from one 
representation to another: that is, a linear operator 

A: F(.) -> F'(.) 


satisfying 


for all group elements g. 


AF(g) = F'(g)A 


2.3 Monoidal Categories 

In physics, it is often useful to think of two systems sitting side by side as forming a single system. In topology, 
the disjoint union of two manifolds is again a manifold in its own right. In logic, the conjunction of two statement 
is again a statement. In programming we can combine two data types into a single ‘product type’. The concept 
of ‘monoidal category’ unifies all these examples in a single framework. 

A monoidal category C has a functor ®:CxC^ C that takes two objects X and Y and puts them together 
to give a new object X ® Y. To make this precise, we need the cartesian product of categories: 

Definition 6 The cartesian product CxC' of categories C and C' is the category where: 

• an object is a pair ( X , X') consisting of an object X &C and an object X' e C'; 

• a morphism from (X,X') to (T, T') is a pair (/,/') consisting of a morphism f:X—> Y and a morphism 
f':X' -> Y’; 

• composition is done componentwise: (g, g')(f, /') = (gf, g'f ): 

• identity morphisms are defined componentwise: 1 (x,x'> = (lx, lxO- 
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Mac Lane ED defined monoidal categories in 1963. The subtlety of the definition lies in the fact that 
{X®Y)®Z and X ® (T <g> Z) are not usually equal. Instead, we should specify an isomorphism between them, 
called the ‘associator’. Similarly, while a monoidal category has a ‘unit object’ I, it is not usually true that I <g> X 
and X® I equal X. Instead, we should specify isomorphisms /<g> X = X and X®I = X. To be manageable, all 
these isomorphisms must then satisfy certain equations: 

Definition 7 A monoidal category consists of: 

• a category C, 

• a tensor product functor ®:CxC->C, 

• a unit object I e C, 

• a natural isomorphism called the associator, assigning to each triple of objects X,Y,Z e C an isomor¬ 
phism 

a x ,Yz : {X®Y)®Z^>X®(Y®Z), 

• natural isomorphisms called the left and right unitors, assigning to each object X e C isomorphisms 

l x :I®X^>X 

r x :X®I^>X, 


such that: 

• for all X,Y e C the triangle equation holds: 

( X®I)®Y - — -^ X® (I® Yj 



12 



for all W, X,Y,Z € C, the pentagon equation holds: 


(( W®X)®Y)®Z 



W®(Z®(F®Z)) 


When we have a tensor product of four objects, there are five ways to parenthesize it, and at first glance the 
associator lets us build two isomorphisms from W <g)(X<8>(Y<8>Z)) to ((W®X)®Y)®Z. But, the pentagon equation 
says these isomorphisms are equal. When we have tensor products of even more objects there are even more 
ways to parenthesize them, and even more isomorphisms between them built from the associator. However, 
Mac Lane showed that the pentagon identity implies these isomorphisms are all the same. Similarly, if we also 
assume the triangle equation, all isomorphisms with the same source and target built from the associator, left 
and right unit laws are equal. 

In a monoidal category we can do processes in ‘parallel’ as well as in ‘series’. Doing processes in series is 
just composition of morphisms, which works in any category. But in a monoidal category we can also tensor 
morph isms /: X —> Y and /': X' —> Y' and obtain a ‘parallel process’ f®f':X®X' —> Y®Y'. We can draw this 
in various ways: 



More generally, we can draw any morphism 

/:x 1 ®---®x„^y 1 ®---®y „ 1 
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black box with n input wires and m output wires: 


Xi 



X 3 


We draw the unit object / as a blank space. So, for example, we draw a morphism /: / —> X as follows: 


By composing and tensoring morphisms, we can build up elaborate pictures resembling Feynman diagrams: 



Y\ ' 1 Y 2 Y 3 Y 4 

The laws governing a monoidal category allow us to neglect associators and unitors when drawing such pictures, 
without getting in trouble. The reason is that Mac Lane’s Coherence Theorem says any monoidal category is 
‘equivalent’, in a suitable sense, to one where all associators and unitors are identity morphisms CD- 

We can also deform the picture in a wide variety of ways without changing the morphism it describes. For 
example, the above morphism equals this one: 
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Everyone who uses string diagrams for calculations in monoidal categories starts by worrying about the rules of 
the game: precisely how can we deform these pictures without changing the morphisms they describe? Instead 
of stating the rules precisely — which gets a bit technical — we urge you to explore for yourself what is allowed 
and what is not. For example, show that we can slide black boxes up and down like this: 



For a formal treatment of the rules governing string diagrams, try the original papers by Joyal and Street m 
and the book by Yetter nun . 

Now let us turn to examples. Here it is crucial to realize that the same category can often be equipped with 
different tensor products, resulting in different monoidal categories: 

• There is a way to make Set into a monoidal category where X <g> Y is the cartesian product X x Y and 
the unit object is any one-element set. Note that this tensor product is not strictly associative, since 
(x, (y,z)) + ((x,y),z), but there’s a natural isomorphism (X x Y) x Z = X x (Y x Z), and this is our 
associator. Similar considerations give the left and right unitors. In this monoidal category, the tensor 
product of f:X —* Y and f':X' —» Y' is the function 

fxf -.XxX' -> YxY' 

(x,x'j » (fix), fix')). 

There is also a way to make Set into a monoidal category where X ® Y is the disjoint union of X and 
T, which we shall denote by X + Y. Here the unit object is the empty set. Again, as indeed with all 
these examples, the associative law and left/right unit laws hold only up to natural isomorphism. In this 
monoidal category, the tensor product of /: X —» Y and f : X' —» Y' is the function 

/ + /': X + X' Y + Y' 

I f(x) if x 6 X, 

X ^ \f(x) if x 6 A'. 

However, in what follows, when we speak of Set as a monoidal category, we always use the cartesian 
product! 

• There is a way to make Hilb into a monoidal category with the usual tensor product of Hilbert spaces: 
C"®C m = C"" ! . In this case the unit object I can be taken to be a 1-dimensional Hilbert space, for example 
C. 

There is also a way to make Hilb into a monoidal category where the tensor product is the direct sum: 
C' ! © C m = C n+m . In this case the unit object is the zero-dimensional Hilbert space, {0}. 

However, in what follows, when we speak o/ Hilb as a monoidal category, we always use the usual tensor 
product! 
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The tensor product of objects and morphisms in nCob is given by disjoint union. For example, the tensor 
product of these two morphisms: 



their tensor product looks like this: 


Z®Z' 

/»/' 

y® r 

The example of Set with its cartesian product is different from our other three main examples, because the 
cartesian product of sets Z x Z' comes equipped with functions called ‘projections’ to the sets X and X': 

X ^— Z x Z' — P -+- X' 
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Our other main examples lack this feature — though Hilb made into a monoidal category using ® has projections. 
Also, every set has a unique function to the one-element set: 

UX^I. 

Again, our other main examples lack this feature, though Hilb made into a monoidal category using © has it. A 
fascinating feature of quantum mechanics is that we make Hilb into a monoidal category using <g> instead of ©, 
even though the latter approach would lead to a category more like Set. 

We can isolate the special features of the cartesian product of sets and its projections, obtaining a definition 
that applies to any category: 


Definition 8 Given objects X and X' in some category, we say an object XxX’ equipped with morphisms 
X XxX’ — P —>~ X' 

is a cartesian product (or simply product) of X and X' if for any object Q and morphisms 


y 


X 


Q 


y 


X' 


there exists a unique morphism g: Q —» X x X' making the following diagram commute: 



X +— p — X x X' —7- X' 


(That is, f = pg and f = p'g.) We say a category has binary products if every pair of objects has a product. 


The product may not exist, and it may not be unique, but when it exists it is unique up to a canonical iso¬ 
morphism. This justifies our speaking of ‘the’ product of objects X and Y when it exists, and denoting it as 
XxY. 

The definition of cartesian product, while absolutely fundamental, is a bit scary at first sight. To illustrate its 
power, let us do something with it: combine two morphisms f:X —» Y and f'.X' —> Y' into a single morphism 

/x/:XxX'-> YxY'. 

The definition of cartesian product says how to build a morphism of this sort out of a pair of morphisms: namely, 
morphisms from X x X' to Y and Y'. If we take these to be fp and f'p' j we obtain / x /': 


XxX' 



Next, let us isolate the special features of the one-element set: 
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Definition 9 An object 1 in a category C is terminal if for any object Q e C there exists a unique morphism 
from Q to 1, which we denote as \q\Q-* 1. 

Again, a terminal object may not exist and may not be unique, but it is unique up to a canonical isomorphism. 
This is why we can speak of ‘the’ terminal object of a category, and denote it by a specific symbol, 1. 

We have introduced the concept of binary products. One can also talk about n-ary products for other values 
of n, but a category with binary products has n-ary products for all n > 1, since we can construct these as iterated 
binary products. The case n = 1 is trivial, since the product of one object is just that object itself (up to canonical 
isomorphism). The remaining case is n = 0. The zero-ary product of objects, if it exists, is just the terminal 
object. So, we make the following definition: 

Definition 10 A category has finite products if it has binary products and a terminal object. 

A category with finite products can always be made into a monoidal category by choosing a specific product 
X x Y to be the tensor product X<g>Y, and choosing a specific terminal object to be the unit object. It takes a bit 
of work to show this! A monoidal category of this form is called cartesian. 

In a cartesian category, we can ‘duplicate and delete information’. In general, the definition of cartesian 
products gives a way to take two morphisms f) : Q —> X and jf. Q —> Y and combine them into a single 
morphism from Q to X x Y. If we take Q = X = Y and take f\ and fa to be the identity, we obtain the diagonal 
or duplication morphism: 

AjA^IxI 

In the category Set one can check that this maps any element x e X to the pair (x, x). In general, we can draw 
the diagonal as follows: 



Similarly, we call the unique map to the terminal object 

] -x'X —> 1 


the deletion morphism, and draw it as follows: 


Note that we draw the unit object as an empty space. 
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A fundamental fact about cartesian categories is that duplicating something and then deleting either copy is 
the same as doing nothing at all! In string diagrams, this says: 



We leave the proof as an exercise for the reader. 

Many of the puzzling features of quantum theory come from the noncartesianness of the usual tensor product 
in Hilb. For example, in a cartesian category, every morphism 



is actually of the form 



In the case of Set, this says that every point of the set X x X' comes from a point of X and a point of X'. In 
physics, this would say that every state g of the combined system X <g> X' is built by combining states of the 
systems X and X'. Bell’s theorem fl9i says that is not true in quantum theory. The reason is that quantum theory 
uses the noncartesian monoidal category Hilb! 

Also, in quantum theory we cannot freely duplicate or delete information. Wootters and Zurek 111001 proved 
a precise theorem to this effect, focused on duplication: the ‘no-cloning theorem’. One can also prove a ‘no¬ 
deletion theorem’. Again, these results rely on the noncartesian tensor product in Hilb. 

2.4 Braided Monoidal Categories 

In physics, there is often a process that lets us ‘switch’ two systems by moving them around each other. In 
topology, there is a tangle that describes the process of switching two points: 
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In logic, we can switch the order of two statements in a conjunction: the statement ‘X and Y’ is isomorphic to 
‘Y and X’. In computation, there is a simple program that switches the order of two pieces of data. A monoidal 
category in which we can do this sort of thing is called ‘braided’: 

Definition 11 A braided monoidal category consists of: 

• a monoidal category C, 

• a natural isomorphism called the braiding that assigns to every pair of objects X,Y e C an isomorphism 

b XY :X®Y Y®X , 


such that the hexagon equations hold: 

X®{Y®Z) (.X®Y)®Z (Y®X)®Z 


{Y®Z)®X —— Y®{Z®X) Y®(X®Z) 

a~^ zx 1 Y®b X jz 

(. X®Y)®Z —^ X®(Y®Z ) X®(Z®Y ) 


Z®(X®Y) - ( Z®X)®Y - ( X®Z)®Y 

a zx,y bxz®tr 

The first hexagon equation says that switching the object X past Y® Z all at once is the same as switching it past 
Y and then past Z (with some associators thrown in to move the parentheses). The second one is similar: it says 
switching X ® Y past Z all at once is the same as doing it in two steps. 

In string diagrams, we draw the braiding bx,y'. X ®Y —> Y ® X like this: 

X \/ 

A 
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We draw its inverse b XY like this: 



This is a nice notation, because it makes the equations saying that bxj and b XY are inverses ‘topologically true’: 
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For practice, we urge you to prove the following equations: 



If you get stuck, here are some hints. The first equation follows from the naturality of the braiding. The second 
is called the Yang-Baxter equation and follows from a combination of naturality and the hexagon equations 

EE 

Next, here are some examples. There can be many different ways to give a monoidal category a braiding, or 
none. However, most of our favorite examples come with well-known ‘standard’ braidings: 

• Any cartesian category automatically becomes braided, and in Set with its cartesian product, this standard 
braiding is given by: 

b X j :XxY -> YxX 
(x, y) h-> (y,x). 

• In Hilb with its usual tensor product, the standard braiding is given by: 

b X j :X®Y -> Y®X 
x®y h-> y ® x. 

• The monoidal category nCob has a standard braiding where bx,Y is diffeomorphic to the disjoint union of 
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cylinders Xx [0,1] and Y X [0,1], For 2Cob this braiding looks as follows when X and Y are circles: 


X®Y 



Y®X 


The monoidal category Tang^ has a standard braiding when k > 2. For k = 2 this looks as follows when 
X and Y are each a single point: 


X®Y 



Y®X 


The example of Tang^ illustrates an important pattern. Tang 0 is just a category, because in 0-dimensional 
space we can only do processes in ‘series’: that is, compose morphisms. Tang, is a monoidal category, because 
in 1-dimensional space we can also do processes in ‘parallel’: that is, tensor morphisms. Tang 2 is a braided 
monoidal category, because in 2-dimensional space there is room to move one object around another. Next we 
shall see what happens when space has 3 or more dimensions! 

2.5 Symmetric Monoidal Categories 

Sometimes switching two objects and switching them again is the same as doing nothing at all. Indeed, this 
situation is very familiar. So, the first braided monoidal categories to be discovered were ‘symmetric’ ones (71 1: 

Definition 12 A symmetric monoidal category is a braided monoidal category where the braiding satisfies 


bx,Y = b Y ' x . 

So, in a symmetric monoidal category, 



Y 
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or equivalently: 


X \ j Y X, jY 

x-x 

Any cartesian category automatically becomes a symmetric monoidal category, so Set is symmetric. It is 
also easy to check that Hilb, nCob are symmetric monoidal categories. So is Tang^ for k > 3. 

Interestingly, Tang^ ‘stabilizes’ at k = 3: increasing the value of k beyond this value merely gives a category 
equivalent to Tang 3 . The reason is that we can already untie all knots in 4-dimensional space; adding extra 
dimensions has no real effect. In fact, Tang fc for k > 3 is equivalent to ICob. This is part of a conjectured larger 
pattern called the ‘Periodic Table’ of n-categories m A piece of this is shown in Tabled 

An n-category has not only morphisms going between objects, but 2-morphisms going between morphisms, 
3-morphisms going between 2-morphisms and so on up to n-morphisms. In topology we can use ^-categories to 
describe tangled higher-dimensional surfaces ITRI . and in physics we can use them to describe not just particles 
but also strings and higher-dimensional membranes mm- The Rosetta Stone we are describing concerns only 
the n = 1 column of the Periodic Table. So, it is probably just a fragment of a larger, still buried n-categorical 
Rosetta Stone. 



n = 0 

n = 1 

n = 2 

k = 0 

sets 

categories 

2-categories 

k= 1 

monoids 

monoidal 

categories 

monoidal 

2-categories 

k = 2 

commutative 

monoids 

braided 

monoidal 

categories 

braided 

monoidal 

2-categories 

k = 3 


symmetric 

monoidal 

categories 

sylleptic 

monoidal 

2-categories 

k = 4 



symmetric 

monoidal 

2-categories 

k = 5 




k = 6 





Table 3: The Periodic Table: conjectured descriptions of (n + ^-categories with only one y-morphism for j < k. 
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2.6 Closed Categories 


In quantum mechanics, one can encode a linear operator f\X —» Y into a quantum state using a technique called 
‘gate teleportation’ B7I . In topology, there is a way to take any tangle /: X — * Y and bend the input back around 
to make it part of the output. In logic, we can take a proof that goes from some assumption X to some conclusion 
Y and turn it into a proof that goes from no assumptions to the conclusion ‘X implies Y’ . In computer science, 
we can take any program that takes input of type X and produces output of type Y, and think of it as a piece of 
data of a new type: a ‘function type’. The underlying concept that unifies all these examples is the concept of a 
‘closed category’. 

Given objects X and Y in any category C, there is a set of morphisms from X to Y, denoted hom(X, Y). In 
a closed category there is also an object of morphisms from X to Y, which we denote by X -o Y. (Many other 
notations are also used.) In this situation we speak of an ‘internal hom’, since the object X Y lives inside C, 
instead of ‘outside’, in the category of sets. 

Closed categories were introduced in 1966, by Eilenberg and Kelly 1591 . While these authors were able 
to define a closed structure for any category, it turns out that the internal hom is most easily understood for 
monoidal categories. The reason is that when our category has a tensor product, it is closed precisely when 
morphisms from X ® Y to Z are in natural one-to-one correspondence with morphisms from Y to X -o Z. In 
other words, it is closed when we have a natural isomorphism 

hom(X<g> F, Z) = hom(F, X -o Z) 

/ -> / 

For example, in the category Set, if we take X <8> Y to be the cartesian product X x F, then X -o Z is just the set 
of functions from X to Z, and we have a one-to-one correspondence between 

• functions / that eat elements ofXxf and spit out elements of Z 


and 


• functions / that eat elements of Y and spit out functions from X to Z. 

This correspondence goes as follows: 

f(x)(y) = f(x,y). 

Before considering other examples, we should make the definition of ‘closed monoidal category’ completely 
precise. For this we must note that for any category C, there is a functor 

hom: C op x C -> Set. 

Definition 13 The opposite category C op of a category C has the same objects as C, but a morphism f:x —» y 
in C op is a morphism f:y —> x in C, and the composite gf in C op is the composite fg in C. 

Definition 14 For any category C, the hom functor 

hom: C op x C -» Set 
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sends any object ( X , Y ) e C op X C to the set hom(X, Y), and sends any morphism (/, g) e C op X C to the function 


hom(/,g): hom(X, Y) —> hom(X', Y') 
h i—> ghf 


when f: X' —* X and g:Y —* Y' are morphisms in C. 

Definition 15 A monoidal category C is left closed if there is an internal hom functor 
-o: C op X C -» C 

together with a natural isomorphism c called currying that assigns to any objects X,Y,Z e C a bijection 
cry? : hom(X ®Y,Z) A hom (X,Y-oZ) 

It is right closed if there is an internal hom functor as above and a natural isomorphism 
Cx,y,z ■ hom (X®Y,Z) —> hom(K, X -o Z). 

The term ‘currying’ is mainly used in computer science, after the work of Curry El- In the rest of this section 
we only consider right closed monoidal categories. Luckily, there is no real difference between left and right 
closed for a braided monoidal category, as the braiding gives an isomorphism X®Y = Y®X. 

All our examples of monoidal categories are closed, but we shall see that, yet again. Set is different from the 
rest: 


• The cartesian category Set is closed, where X Y is just the set of functions from X to Y. In Set or any 
other cartesian closed category, the internal hom X -o Y is usually denoted Y x . To minimize the number 
of different notations and emphasize analogies between different contexts, we shall not do this: we shall 
always use X -o Y. To treat Set as left closed, we define the curried version of /: X x T —» Z as above: 

f(x)(y) = f(x,y). 

To treat it as right closed, we instead define it by 

f(y)(x) = fix, y). 

This looks a bit awkward, but it will be nice for string diagrams. 

• The symmetric monoidal category Hilb with its usual tensor product is closed, where X -o Y is the set 
of linear operators from X to Y, made into a Hilbert space in a standard way. In this case we have an 
isomorphism 

X -o Y = X* ® Y 

where X* is the dual of the Hilbert space X, that is, the set of linear operators /: X —> C, made into a 
Hilbert space in the usual way. 

• The monoidal category Tang*, (k > 1) is closed. As with Hilb, we have 

X -o Y = X* <g>Y 

where X* is the orientation-reversed version of X. 
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• The symmetric monoidal category nCob is also closed; again 
X -oY = X* ®Y 

where X* is the ( n - l)-manifold X with its orientation reversed. 

Except for Set, all these examples are actually ‘compact’. This basically means that X -o Y is isomorphic to 
X* ® Y, where X* is some object called the ‘dual’ of X. But to make this precise, we need to define the ‘dual’ of 
an object in an arbitrary monoidal category. 

To do this, let us generalize from the case of Hilb. As already mentioned, each object X e Hilb has a dual 
X* consisting of all linear operators /: X —» I, where the unit object I is just C. There is thus a linear operator 

<? x : -> / 
x®f h-» f{x) 

called the counit of X. Furthermore, the space of all linear operators from X to Y e Hilb can be identified with 
X* ® Y. So, there is also a linear operator called the unit of X: 

ix: I -» 

C C l x 

sending any complex number c to the corresponding multiple of the identity operator. 

The significance of the unit and counit become clearer if we borrow some ideas from Feynman. In physics, if 
X is the Hilbert space of internal states of some particle, X* is the Hilbert space for the corresponding antiparticle. 
Feynman realized that it is enlightening to think of antiparticles as particles going backwards in time. So, we 
draw a wire labelled by X* as a wire labelled by X , but with an arrow pointing ‘backwards in time’: that is, up 
instead of down: 

r| - 4 


(Here we should admit that most physicists use the opposite convention, where time marches up the page. Since 
we read from top to bottom, we prefer to let time run down the page.) 

If we draw X* as X going backwards in time, we can draw the unit as a cap: 


and the counit as a cup: 

In Feynman diagrams, these describe the creation and annihilation of virtual particle-antiparticle pairs! 
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It then turns out that the unit and counit satisfy two equations, the zig-zag equations: 



Verifying these is a fun exercise in linear algebra, which we leave to the reader. If we write these equations as 
commutative diagrams, we need to include some associators and unitors, and they become a bit intimidating: 


x®i —- x®(x*®x) aw - (X®X*)®X 



X - - - - I®X 


I®X* ix ® lx ► (X*®X)®X* X*®(X®X*) 

lx | | l x *Oex 

v* *-g|- X*® I 


But, they really just say that zig-zags in string diagrams can be straightened out. 


This is particularly vivid in examples like Tang,,, and nCob. For example, in 2Cob, taking X to be the circle, 
the unit looks like this: 


*0 1 
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while the counit looks like this: 

V* 

/ 

In this case, the zig-zag identities say we can straighten a wiggly piece of pipe. 
Now we are ready for some definitions: 


Definition 16 Given objects X* and X in a monoidal category, we call X* a right dual of X, and X a left dual 
of X*, if there are morphisms 

i x :I^X*®X 


and 


called the unit and counit respectively, satisfying the zig-zag equations. 


One can show that the left or right dual of an object is unique up to canonical isomorphism. So, we usually 
speak of ‘the’ right or left dual of an object, when it exists. 

Definition 17 A monoidal category C is compact if every object X e C has both a left dual and a right dual. 


Often the term ‘autonomous’ is used instead of ‘compact’ here. Many authors reserve the term ‘compact’ for 
the case where C is symmetric or at least braided; then left duals are the same as right duals, and things simplify 
l42l . To add to the confusion, compact symmetric monoidal categories are often called simply ‘compact closed 
categories’. 

A partial explanation for the last piece of terminology is that any compact monoidal category is automatically 
closed! For this, we define the internal hom on objects by 

X -o Y = X* ®Y. 

We must then show that the * operation extends naturally to a functor *: C —» C, so that -o is actually a functor. 
Finally, we must check that there is a natural isomorphism 

hom(X ®Y,Z) = hom( Y, X* ® Z) 

In terms of string diagrams, this isomorphism takes any morphism 
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and bends back the input wire labelled X to make it an output: 


X 



Now, in a compact monoidal category, we have: 


X 



Z 


But in general, closed monoidal categories don’t allow arrows pointing up! So for these, drawing the internal 
horn is more of a challenge. We can use the same style of notation as long as we add a decoration — a clasp — 
that binds two strings together: 

X^>Z 

Only when our closed monoidal category happens to be compact can we eliminate the clasp. 

Suppose we are working in a closed monoidal category. Since we draw a morphism f:X®Y —» Z like this: 




we can draw its curried version /: Y —» X -o Z by bending down the input wire labelled X to make it part of the 
output: 



Note that where we bent back the wire labelled X, a cap like this appeared: 

! n I 
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Closed monoidal categories don’t really have a cap unless they are compact. So, we drew a bubble enclosing / 
and the cap, to keep us from doing any illegal manipulations. In the compact case, both the bubble and the clasp 
are unnecessary, so we can draw / like this: 



An important special case of currying gives the name of a morphism f:X —> Y, 


r P: I —> X -o Y. 

This is obtained by currying the morphism 

fr x :I®X —» Y. 

In string diagrams, we draw r f' as follows: 


In the category Set, the unit object is the one-element set, 1. So, a morphism from this object to a set Q picks 
out a point of Q. In particular, the name r / 1 :1 —» X -o Y picks out the element of X -o Y corresponding to 
the function f :X —> Y. More generally, in any cartesian closed category the unit object is the terminal object 1, 
and a morphism from 1 to an object Q is called a point of Q. So, even in this case, we can say the name of a 
morphism /: X —» Y is a point of A -o Y. 

Something similar works for Hilb, though this example is compact rather than cartesian. In Hilb, the unit 
object I is just C. So, a nonzero morphism from I to any Hilbert space Q picks out a nonzero vector in Q, which 
we can normalize to obtain a state in Q: that is, a unit vector. In particular, the name of a nonzero morphism 
/: X —» Y gives a state of X* <g> Y. This method of encoding operators as states is the basis of ‘gate teleportation’ 
1471 . 

Currying is a bijection, so we can also uncurry: 

c~' YZ : hom (Y,X-oZ) -> hom (X®Y,Z) 

8 ^ 1 - 

Since we draw a morphism g: Y —» X -o Z like this: 
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we draw its ‘uncurried’ version g: X ® F —» Z by bending the output X up to become an input: 



Again, we must put a bubble around the ‘cup’ formed when we bend down the wire labelled F, unless we are in 
a compact monoidal category. 

A good example of uncurrying is the evaluation morphism: 

evx,y:X®(X^>F)^F 

This is obtained by uncurrying the identity 

W: (X -o F) —> (X -o F). 

In Set, evx,Y takes any function from X to F and evaluates it at any element of X to give an element of F. In 
terms of string diagrams, the evaluation morphism looks like this: 



In any closed monoidal category, we can recover a morphism from its name using evaluation. More precisely, 
this diagram commutes: 

X®I ■*——- X 

|/ 

X ® (X -o F) - F 
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Or, in terms of string diagrams: 



0 


We leave the proof of this as an exercise. In general, one must use the naturality of currying. In the special case 
of a compact monoidal category, there is a nice picture proof! Simply pop the bubbles and remove the clasp: 





The result then follows from one of the zig-zag identities. 

In our rapid introduction to string diagrams, we have not had time to illustrate how these diagrams become 
a powerful tool for solving concrete problems. So, here are some starting points for further study: 

• Representations of Lie groups play a fundamental role in quantum physics, especially gauge field theory. 
Every Lie group has a compact symmetric monoidal category of finite-dimensional representations. In his 
book Group Theory, Cvitanovic <361 develops detailed string diagram descriptions of these representation 
categories for the classical Lie groups SU(n), SO(n), SU(n) and also the more exotic ‘exceptional’ Lie 
groups. His book also illustrates how this technology can be used to simplify difficult calculations in 
gauge field theory. 

• Quantum groups are a generalization of groups which show up in 2d and 3d physics. The big difference 
is that a quantum group has compact braided monoidal category of finite-dimensional representations. 
Kauffman’s Knots and Physics ll59l is an excellent introduction to how quantum groups show up in knot 
theory and physics; it is packed with string diagrams. For more details on quantum groups and braided 
monoidal categories, see the book by Kassel 1581 . 
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• Kauffman and Lins ll60l have written a beautiful string diagram treatment of the category of represen¬ 
tations of the simplest quantum group, S U q ( 2). They also use it to construct some famous 3-manifold 
invariants associated to 3d and 4d topological quantum field theories: the Witten-Reshetikhin-Turaev, 
Turaev-Viro and Crane-Yetter invariants. In this example, string diagrams are often called ‘^-deformed 
spin networks’ El- For generalizations to other quantum groups, see the more advanced texts by Turaev 
E3 and by Bakalov and Kirillov fra . The key ingredient is a special class of compact braided monoidal 
categories called ‘modular tensor categories’. 

• Kock l64l has written a nice introduction to 2d topological quantum field theories which uses diagram¬ 
matic methods to work with 2Cob. 

• Abramsky, Coecke and collaborators [2 3 4 30 32 33| have developed string diagrams as a tool for 
understanding quantum computation. The easiest introduction is Coecke’s ‘Kindergarten quantum me¬ 
chanics’ EH- 

2.7 Dagger Categories 

Our discussion would be sadly incomplete without an important admission: nothing we have done so far with 
Hilbert spaces used the inner product! So, we have not yet touched on the essence of quantum theory. 

Everything we have said about Hilb applies equally well to Vect: the category of finite-dimensional vector 
spaces and linear operators. Both Hilb and Vect are compact symmetric monoidal categories. In fact, these 
compact symmetric monoidal categories are ‘equivalent’ in a certain precise sense 11721 . 

So, what makes Hilb different? In terms of category theory, the special thing is that we can take the Hilbert 
space adjoint of any linear operator f:X —> Y between finite-dimensional Hilbert spaces, getting an operator 
f: Y -> X. This ability to ‘reverse’ morphisms makes Hilb into a ‘dagger category’: 

Definition 18 A dagger category is a category C such that for any morphism f:X —> Y inC there is a specified 
morphism f^'.Y —» X such that 

(s/) f = fg r 

for every pair of composable morphisms f and g, and 

(/¥=/ 

for every morphism f. 

Equivalently, a dagger category is one equipped with a functor f: C —» C op that is the identity on objects and 
satisfies (f^f = f for every morphism. 

In fact, all our favorite examples of categories can be made into dagger categories, except for Set: 

• There is no way to make Set into a dagger category, since there is a function from the empty set to the 
1-element set, but none the other way around. 

• The category Hilb becomes a dagger category as follows. Given any morphism /: X —> Y in Hilb, there is 
a morphism/f; Y —> X, the Hilbert space adjoint of /, defined by 

</V> <t>) = (f, if) 

for all <p € X, if € Y. 
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For any k, the category Tang^ becomes a dagger category where we obtain /*: Y —> X by reflecting 
/: X —> Y in the vertical direction, and then switching the direction of the little arrows denoting the 
orientations of arcs and circles. 

For any n, the category nCob becomes a dagger category where we obtain / + : Y —> X by switching the 
input and output of /: X —> Y, and then switching the orientation of each connected component of /. 
Again, a picture speaks a thousand words: 


A i V 


Y 

t 

X 


In applications to physics, this dagger operation amounts to ‘switching the future and the past’. 


In all the dagger categories above, the dagger structure interacts in a nice way with the monoidal structure 
and also, when it exists, the braiding. One can write a list of axioms characterizing how this works 11511511371 . 
So, it seems that the ability to ‘reverse’ morphisms is another way in which categories of a quantum flavor differ 
from the category of sets and functions. This has important implications for the foundations of quantum theory 
tm and also for topological quantum field theory HE where dagger categories seem to be part of larger story 
involving ‘n-categories with duals’ 03l. However, this story is still poorly understood — there is much more 
work to be done. 


3 Logic 

3.1 Background 

Symmetric monoidal closed categories show up not only in physics and topology, but also in logic. We would 
like to explain how. To set the stage, it seems worthwhile to sketch a few ideas from 20th-century logic. 

Modern logicians study many systems of reasoning beside ordinary classical logic. Of course, even classical 
logic comes in various degrees of strength. First there is the ‘propositional calculus’, which allows us to reason 
with abstract propositions X, Y, Z,... and these logical connectives: 

and A 

or V 

implies => 

not -i 

true T 

false ± 

Then there is the ‘predicate calculus’, which also allows variables like x,y,z ,..., predicates like P(x) and 
Q(x, y, z ), and the symbols ‘for all’ (V) and ‘there exists’ (3), which allow us to quantify over variables. There are 
also higher-order systems that allow us to quantify over predicates, and so on. To keep things simple, we mainly 
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confine ourselves to the propositional calculus in what follows. But even here, there are many alternatives to the 
‘classical’ version! 

The most-studied of these alternative systems are weaker than classical logic: they make it harder or even 
impossible to prove things we normally take for granted. One reason is that some logicians deny that certain 
familiar principles are actually valid. But there are also subtler reasons. One is that studying systems with rules 
of lesser strength allows for a fine-grained study of precisely which methods of reasoning are needed to prove 
which results. Another reason — the one that concerns us most here — is that dropping familiar rules and then 
adding them back in one at at time sheds light on the connection between logic and category theory. 

For example, around 1907 Brouwer Il49l began advocating ‘intuitionism’. As part of this, he raised doubts 
about the law of excluded middle, which amounts to a rule saying that from -1-1Z we can deduce X. One problem 
with this principle is that proofs using it are not ‘constructive’. For example, we may prove by contradiction 
that some equation has a solution, but still have no clue how to construct the solution. For Brouwer, this meant 
the principle was invalid. 

Anyone who feels the law of excluded middle is invalid is duty-bound to study intuitionistic logic. But, 
there is another reason for studying this system. Namely: we do not really lose anything by dropping the law of 
excluded middle! Instead, we gain a fine-grained distinction: the distinction between a direct proof of X and a 
proof by contradiction, which yields merely -i->X. If we do not care about this distinction we are free to ignore 
it, but there is no harm in having it around. 

In the 1930’s, this idea was made precise by Godel Il45l and Gentzen 1931 . They showed that we can embed 
classical logic in intuitionistic logic. In fact, they found a map sending any formula X of the propositional 
calculus to a new formula X°, such that X is provable classically if and only if X° is provable intuitionistically. 
(More impressively, this map also works for the predicate calculus.) 

Later, yet another reason for being interested in intuitionistic logic became apparent: its connection to cate¬ 
gory theory. In its very simplest form, this connection works as follows. Suppose we have a set of propositions 
X, Y, Z,... obeying the laws of the intuitionistic propositional calculus. We can create a category C where these 
propositions are objects and there is at most one morphism from any object X to any object Y : a single morphism 
when X implies Y, and none otherwise! 

A category with at most one morphism from any object to any other is called a preorder. In the propositional 
calculus, we often treat two propositions as equal when they both imply each other. If we do this, we get a 
special sort of preorder: one where isomorphic objects are automatically equal. This special sort of preorder is 
called a partially ordered set, or poset for short. Posets abound in logic, precisely because they offer a simple 
framework for understanding implication. 

If we start from a set of propositions obeying the intuitionistic propositional calculus, the resulting category 
C is better than a mere poset. It is also cartesian, with lAfas the product of X and Y, and T as the terminal 
object! To see this, note that any proposition Q has a unique morphism to X A Y whenever it has morphisms to 
X and to Y. This is simply a fancy way of saying that Q implies ZAP when it implies X and implies Y. It is 
also easy to see that T is terminal: anything implies the truth. 

Even better, the category C is cartesian closed, with X => Y as the internal horn. The reason is that 
lA Y implies Z iff Y implies X => Z. 

This automatically yields the basic property of the internal horn: 

hom(X ®Y,Z) = hom(K X-o Z). 
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Indeed, if the reader is puzzled by the difference between ‘ X implies F’ and X => F, we can now explain this 
more clearly: the former involves the homset hom(X, Y) (which has one element when X implies Y and none 
otherwise), while the latter is the internal hom, an object in C. 

So, C is a cartesian closed poset. But, it also has one more nice property, thanks to the presence of V and ± 
We have seen that A and T make the category C cartesian; V and _L satisfy exactly analogous rules, but with the 
implications turned around, so they make C op cartesian. 

And that is all! In particular, negation gives nothing more, since we can define ->X to be X => _L, and 
all its intuitionistically valid properties then follow. So, the kind of category we get from the intuitionistic 
propositional calculus by taking propositions as objects and implications as morphisms is precisely a Heyting 
algebra: a cartesian closed poset C such that C op is also cartesian. 

Heyting, a student of Brouwer, introduced Heyting algebras in intuitionistic logic before categories were 
even invented. So, he used very different language to define them. But, the category-theoretic approach to 
Heyting algebras illustrates the connection between cartesian closed categories and logic. It also gives more 
evidence that dropping the law of excluded middle is an interesting thing to try. 

Since we have explained the basics of cartesian closed categories, but not said what happens when the 
opposite of such a category is also cartesian, in the sections to come we will take a drastic step and limit 
our discussion of logic even further. We will neglect V and ±, and concentrate only on the fragment of the 
propositional calculus involving A, T and =>. 

Even here, it turns out, there are interesting things to say — and interesting ways to modify the usual rules. 
This will be the main subject of the sections to come. But to set the stage, we need to say a bit about proof 
theory. 

Proof theory is the branch of mathematical logic that treats proofs as mathematical entities worthy of study 
in their own right. It lets us dig deeper into the propositional calculus by studying not merely whether or not 
some assumption X implies some conclusion Y, but the whole set of proofs leading from X to Y. This amounts 
to studying not just posets (or preorders), but categories that allow many morphisms from one object to another. 

In Hilbert’s approach to proof, there were many axioms and just one rule to deduce new theorems: modus 
ponens, which says that from X and ‘X implies F we can deduce Y. Most of modern proof theory focuses on 
another approach, the ‘sequent calculus’, due to Gentzen 11951 . In this approach there are few axioms but many 
inference rules. 

An excellent introduction to the sequent calculus is the book Proofs and Types by Girard, Lafont and Tay¬ 
lor, freely available online ll44l . Here we shall content ourselves with some sketchy remarks. A ‘sequent’ is 
something like this: 

X u ...,X m h Fi,...,T„ 

where A, and F, are propositions. We read this sequent as saying that all the propositions X h taken together, can 
be used to prove at least one of the propositions F,. This strange-sounding convention gives the sequent calculus 
a nice symmetry, as we shall soon see. 

In the sequent calculus, an ‘inference rule’ is something that produces new sequents from old. For example, 
here is the left weakening rule: 


X\,.. .,X m h F],., 

Aj,...,A m ,Ah Y u . 


., F„ 

.., F„ 
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This says that from the sequent above the line we can get the sequent below the line: we can throw in the 
extra assumption A without harm. Thanks to the strange-sounding convention we mentioned, this rule has a 
mirror-image version called right weakening: 


X u ...,X m h Y u ...,Y n 
X u ...,X m h Y\,..Y„,A 

In fact, Gentzen’s whole setup has this mirror symmetry! For example, his rule called left contraction: 

X u ...,X m ,A,Ab Y\,...,Y n 
X u ...,X m ,A\-Y u ...,Y n 

has a mirror partner called right contraction: 

X u ...,X m h Y lt ...,Y n ,A,A 
X u ...,X m h Y\,... ,Y n ,A 

Similarly, this rule for ‘and’ 

Xt,...,X m ,A h Y U ...,Y„ 

X\,... ,X m ,A A B \- Yi,... ,Y„ 

has a mirror partner for ‘or’: 

X u ...,X m h Y x ,...,Y n ,A 
X u ...,X m h Y u ...,Y n ,A V B 

Logicians now realize that this mirror symmetry can be understood in terms of the duality between a category 
and its opposite. 

Gentzen used sequents to write inference rules for the classical propositional calculus, and also the classical 
predicate calculus. Now, in these forms of logic we have 

X u ...,X m h Fi,...,T„ 


if and only if we have 


Xj A • • • A X m h Ti V • • • V Y n . 


So, why did Gentzen use sequents with a list of propositions on each side of the I- symbol, instead just a single 
proposition? The reason is that this let him use only inference rules having the ‘subformula property’. This says 
that every proposition in the sequent above the line appears as part of some proposition in the sequent below the 
line. So, a proof built from such inference rules becomes a ‘tree’ where all the propositions further up the tree 
are subformulas of those below. 

This idea has powerful consequences. For example, in 1936 Gentzen was able prove the consistency of 
Peano’s axioms of arithmetic! His proof essentially used induction on trees (Readers familiar with Godel’s 
second incompleteness theorem should be reassured that this sort of induction cannot itself be carried out in 
Peano arithmetic.) 

The most famous rule lacking the subformula property is the ‘cut rule’: 
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X u ...,X m h Y u ...,Y k ,A X m+u ...,X n ,A h Y k+ 1,...,Y( 
X u ...,X n h Y u ...,Y e 


From the two sequents on top, the cut rule gives us the sequent below. Note that the intermediate step A does not 
appear in the sequent below. It is ‘cut out’. So, the cut rule lacks the subformula property. But, one of Gentzen’s 
great achievements was to show that any proof in the classical propositional (or even predicate) calculus that 
can be done with the cut rule can also be done without it. This is called ‘cut elimination’. 

Gentzen also wrote down inference rules suitable for the intuitionistic propositional and predicate calculi. 
These rules lack the mirror symmetry of the classical case. But in the 1980s, this symmetry was restored by 
Girard’s invention of ‘linear logic’ m . 

Linear logic lets us keep track of how many times we use a given premise to reach a given conclusion. To 
accomplish this, Girard introduced some new logical connectives! For starters, he introduced ‘linear’ connec¬ 
tives called ® and -o, and a logical constant called I. These act a bit like A, => and T. However, they satisfy 
rules corresponding to a symmetric monoidal category instead of a cartesian closed category. In particular, from 
X we can prove neither X <g> X nor I. So, we cannot freely ‘duplicate’ and ‘delete’ propositions using these new 
connectives. This is reflected in the fact that linear logic drops Gentzen’s contraction and weakening rules. 

By itself, this might seem unbearably restrictive. However, Girard also kept the connectives A, => and T in 
his system, still satisfying the usual rules. And, he introduced an operation called the ‘exponential’, !, which 
takes a proposition X and turns it into an ‘arbitrary stock of copies of X’ . So, for example, from \X we can prove 
1, and X, and X ® X, and X ® X ® X, and so on. 

Full-fledged linear logic has even more connectives than we have described here. It seems baroque and 
peculiar at first glance. It also comes in both classical and intuitionistic versions! But Just as classical logic can 
be embedded in intuitionistic logic, intuitionistic logic can be embedded in intuitionistic linear logic E3). So, 
we do not lose any deductive power. Instead, we gain the ability to make even more fine-grained distinctions. 

In what follows, we discuss the fragment of intuitionistic linear logic involving only ®, -o and I. This is 
called ‘multiplicative intuititionistic linear logic’ ESIEJj. It turns out to be the system of logic suitable for 
closed symmetric monoidal categories — nothing more or less. 

3.2 Proofs as Morphisms 

In Section [2 we described categories with various amounts of extra structure, starting from categories pure and 
simple, and working our way up to monoidal categories, braided monoidal categories, symmetric monoidal 
categories, and so on. Our treatment only scratched the surface of an enormously rich taxonomy. In fact, each 
kind of category with extra structure corresponds to a system of logic with its own inference rules! 

To see this, we will think of propositions as objects in some category, and proofs as giving morphisms. 
Suppose X and Y are propositions. Then, we can think of a proof starting from the assumption X and leading to 
the conclusion Y as giving a morphism f:X —> Y. (In Section POl we shall see that a morphism is actually an 
equivalence class of proofs — but for now let us gloss over this issue.) 

Let us write X h Y when, starting from the assumption X , there is a proof leading to the conclusion Y. An 
inference rule is a way to get new proofs from old. For example, in almost every system of logic, if there is a 
proof leading from X to Y, and a proof leading from Y to Z, then there is a proof leading from X to Z. We write 
this inference rule as follows: 
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Xl-Y FhZ 
AhZ 

We can call this cut rule, since it lets us ‘cut out’ the intermediate step Y. It is a special case of Gentzen’s cut 
rule, mentioned in the previous section. It should remind us of composition of morphisms in a category: if we 
have a morphism f:X —» Y and a morphism g: Y —» Z, we get a morphism gf: X —» Z. 

Also, in almost every system of logic there is a proof leading from A to X. We can write this as an inference 
rule that starts with nothing and concludes the existence of a proof of X from X: 

Xl-X 

This rule should remind us of how every object in category has an identity morphism: for any object X, we 
automatically get a morphism 1^: X —» X. Indeed, this rule is sometimes called the identity rule. 

If we pursue this line of thought, we can take the definition of a closed symmetric monoidal category and 
extract a collection of inference rules. Each rule is a way to get new morphisms from old in a closed symmetric 
monoidal category. There are various superficially different but ultimately equivalent ways to list these rules. 
Here is one: 


X 


X § 


X_ 


Y YvZ 
Ah Z (o) 


W I- X ThZ w h (X®Y)®Z 

W ® Y h A ® Z m w h A ® ( Y 0 Z) 


A h 7® T Ahy®/ 

= (1) = (r) 

A h T A h T 


ffh A8f 
ffh Y ®X 


(b) 


Double lines mean that the inverse rule also holds. We have given each rule a name, written to the right in paren¬ 
theses. As already explained, rules (i) and (o) come from the presence of identity morphisms and composition 
in any category. Rules (®), (a), (1), and (r) come from tensoring, the associator, and the left and right unitors in 
a monoidal category. Rule (b) comes from the braiding in a braided monoidal category, and rule (c) comes from 
currying in a closed monoidal category. 

Now for the big question: what does all this mean in terms of logic? These rules describe a small fragment of 
the propositional calculus. To see this, we should read the connective ® as ‘and’, the connective -o as ‘implies’, 
and the proposition I as ‘true’. 

In this interpretation, rule (c) says we can turn a proof leading from the assumption ‘Y and A’ to the conclu¬ 
sion Z into a proof leading from A to ‘ Y implies Z’. It also says we can do the reverse. This is true in classical, 
intuitionistic and linear logic, and so are all the other rules. Rules (a) and (b) say that ‘and’ is associative and 
commutative. Rule (1) says that any proof leading from the assumption A to the conclusion ‘true and Y’ can be 
converted to a proof leading from A to Y, and vice versa. Rule (r) is similar. 

What do we do with these rules? We use them to build ‘deductions’. Here is an easy example: 

a -o y h a -o y ® 
a ® (a -o y) i- y (c ' 
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First we use the identity rule, and then the inverse of the currying rule. At the end, we obtain 
X ® (X -o F) h F. 

This should remind us of the evaluation morphisms we have in a closed monoidal category: 

ev z , y :X®(X^F)^F 

In terms of logic, the point is that we can prove F from X and ‘X implies F’. This fact comes in handy so often 
that we may wish to abbreviate the above deduction as an extra inference rule — a rule derived from our basic 
list: 


X ® (X -o F) h F (ev) 

This rule is called modus ponens. 

In general, a deduction is a tree built from inference rules. Branches arise when we use the (o) or (®) rules. 
Here is an example: 

(A ® B) ® C h (A ® B) ® C (I) 

(A®B)®Ch A®(B®C) W A®(B®C)\-D 

(A ® B) ® C h D (o) 

Again we can abbreviate this deduction as a derived rule. In fact, this rule is reversible: 

A®(B®C) h D 
(A <g> B) ® C h D {a) 

For a more substantial example, suppose we want to show 

(X -o F) ® (F -o Z) h X -o Z. 

The deduction leading to this will not even fit on the page unless we use our abbreviations: 

X ® (X -o F) h F <CT) F^ZhK^Z W 

(X® (X -«F))®(F-oZ)hF«(F-oZ) <8) F®(F^>Z)hZ ^ 

(X®(X^ F)) ® (F -o Z)\- Z 
X ® ((X -O F) ® (F -O Z)) h Z ( “ ) 

(X -O F) ® (F -O Z) h X -O Z (C) 

Since each of the rules used in this deduction came from a way to get new morphisms from old in a closed 
monoidal category (we never used the braiding), it follows that in every such category we have internal com¬ 
position morphisms: 

: (X -° F) ® (F -o Z) —> X -o Z. 

These play the same role for the internal horn that ordinary composition 

o: hom(X, F) X hom(F,Z) -» hom(X,Z) 

plays for the ordinary horn. 

We can go ahead making further deductions in this system of logic, but the really interesting thing is what it 
omits. For starters, it omits the connective ‘or’ and the proposition ‘false’. It also omits two inference rules we 
normally take for granted — namely, contraction: 
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Ih Y 


and weakening: 


X h Y , 

X\-I 

which are closely related to duplication and deletion in a cartesian category. Omitting these rules is a distinctive 
feature of linear logic E3 The word ‘linear’ should remind us of the category Hilb. As noted in Section 12.31 
this category with its usual tensor product is noncartesian, so it does not permit duplication and deletion. But, 
what does omitting these rules mean in terms of logic? 

Ordinary logic deals with propositions, so we have been thinking of the above system of logic in the same 
way. Linear logic deals not just with propositions, but also other resources — for example, physical things! 
Unlike propositions in ordinary logic, we typically can’t duplicate or delete these other resources. In classical 
logic, if we know that a proposition X is true, we can use X as many or as few times as we like when trying 
to prove some proposition Y. But if we have a cup of milk, we can’t use it to make cake and then use it again 
to make butter. Nor can we make it disappear without a trace: even if we pour it down the drain, it must go 
somewhere. 

In fact, these ideas are familiar in chemistry. Consider the following resources: 

H 2 = one molecule of hydrogen 
0 2 = one molecule of oxygen 

H2O = one molecule of water 

We can burn hydrogen, combining one molecule of oxygen with two of hydrogen to obtain two molecules of 
water. A category theorist might describe this reaction as a morphism: 

/: O2 9 (H 2 9 H 2 ) -> H 2 0 9 H 2 0. 


A linear logician might write: 

0 2 ®(H 2 ®H 2 )m 2 0®H 2 0 

to indicate the existence of such a morphism. But, we cannot duplicate or delete molecules, so for example 
H 2 \/H 2 ®H 2 


and 

H 2 \/I 

where I is the unit for the tensor product: not iodine, but ‘no molecules at all’. 

In short, ordinary chemical reactions are morphisms in a symmetric monoidal category where objects are 
collections of molecules. As chemists normally conceive of it, this category is not closed. So, it obeys an even 
more limited system of logic than the one we have been discussing, a system lacking the connective To get 
a closed category — in fact a compact one — we need to remember one of the great discoveries of 20th-century 
physics: antimatter. This lets us define Y -o Z to be ‘anti-T andZ’: 

Y -oZ=Y* ®Z. 


Then the currying rule holds: 
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Y®XvZ 


Xv Y* ®Z 

Most chemists don’t think about antimatter very often — but particle physicists do. They don’t use the notation 
of linear logic or category theory, but they know perfectly well that since a neutrino and a neutron can collide 
and turn into a proton and an electron: 

then a neutron can turn into a antineutrino together with a proton and an electron: 

nv v*®(p®e). 


This is an instance of the currying rule, rule (c). 

3.3 Logical Theories from Categories 

We have sketched how different systems of logic naturally arise from different types of categories. To illustrate 
this idea, we introduced a system of logic with inference rules coming from ways to get new morphisms from 
old in a closed symmetric monoidal category. One could substitute many other types of categories here, and get 
other systems of logic. 

To tighten the connection between proof theory and category theory, we shall now describe a recipe to get 
a logical theory from any closed symmetric monoidal category. For this, we shall now use X h Y to denote the 
set of proofs — or actually, equivalence classes of proofs — leading from the assumption X to the conclusion 
Y. This is a change of viewpoint. Previously we would write X v Y when this set of proofs was nonempty; 
otherwise we would write X \/ F. The advantage of treating X v Y as a set is that this set is precisely what a 
category theorist would call hom(X, Y): a homset in a category. 

If we let X v Y stand for a homset, an inference rule becomes a function from a product of homsets to a 
single homset. For example, the cut rule 


Xv Y Y v Z 
Xv Z 

becomes another way of talking about the composition function 

o xxz : hom(X, F) X hom(F, Z) -> hom(X, Z), 

while the identity rule 


XU 

becomes another way of talking about the function 

i x : 1 —> homiX, X) 

that sends the single element of the set 1 to the identity morphism of X. (Note: the set 1 is a zero-fold product 
of homsets.) 

Next, if we let inference rules be certain functions from products of homsets to homsets, deductions become 
more complicated functions of the same sort built from these basic ones. For example, this deduction: 
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(X®I)®Y\-X® Y 


specifies a function from 1 to hom((X ®I)®Y,X® 7), built from the basic functions indicated by the labels at 
each step. This deduction: 


_ m I ®Y I®Y ® 

(X®I)®Yh(X®I)®Y I ®Y \- Y W XhX ® 

(X®I)®Y Y X®(I®Y) 3 X®{I®Y)yX®Y m 

(X®I)®Y\-X®Y ( °’ 


gives another function from 1 to hom((X ®I)®Y,X® Y). 

If we think of deductions as giving functions this way, the question arises when two such functions are equal. 
In the example just mentioned, the triangle equation in the definition of monoidal category (Definition 0: 


(X® I)® Y -—-- X® (I ®Y) 


X®Y 

says these two functions are equal. Indeed, the triangle equation is precisely the statement that these two 
functions agree! (We leave this as an exercise for the reader.) 

So: even though two deductions may look quite different, they may give the same function from a product 
of homsets to a homset if we demand that these are homsets in a closed symmetric monoidal category. This is 
why we think of X -o Y as a set of equivalence classes of proofs, rather than proofs: it is forced on us by our 
desire to use category theory. We could get around this by using a 2-category with proofs as morphisms and 
‘equivalences between proofs’ as 2-morphisms 1831 . This would lead us further to the right in the Periodic Table 
(Tabled. But let us restrain ourselves and make some definitions formalizing what we have done so far. 

From now on we shall call the objects X, Y,... ‘propositions’, even though we have seen they may represent 
more general resources. Also, purely for the sake of brevity, we use the term ‘proof’ to mean ‘equivalence class 
of proofs’. The equivalence relation must be coarse enough to make the equations in the following definitions 
hold: 

Definition 19 A closed monoidal theory consists of the following: 

• A collection o/propositions. The collection must contain a proposition I, and ifX and Y are propositions, 
then so areX®Y andX Y. 

• For every pair of propositions X, Y,asetX h Y o/proofs leading from X to Y. Iff e X h Y, then we write 
f:X —*Y. 

• Certain functions, written as inference rules: 
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These are just the usual definitions of various kinds of closed category — monoidal, braided monoidal and 
symmetric monoidal — written in a new style. This new style lets us build such categories from logical systems. 
To do this, we take the objects to be propositions and the morphisms to be equivalence classes of proofs, where 
the equivalence relation is generated by the equations listed in the definitions above. 

However, the full advantages of this style only appear when we dig deeper into proof theory, and generalize 
the expressions we have been considering: 

X\-Y 

to ‘sequents’ like this: 

Xj,.. ,,X„ h Y. 

Loosely, we can think of such a sequent as meaning 

Xj ® • • • <g> X n h Y. 
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The advantage of sequents is that they let us use inference rules that — except for the cut rule and the identity 
rule — have the ‘subformula property’ mentioned near the end of Section ITT! 

Formulated in terms of these inference rules, the logic of closed symmetric monoidal categories goes by the 
name of ‘multiplicative intuitionistic linear logic’, or MILL for short H48II82I . There is a ‘cut elimination’ theo¬ 
rem for MILL, which says that with a suitable choice of other inference rules, the cut rule becomes redundant: 
any proof that can be done with it can be done without it. This is remarkable, since the cut rule corresponds 
to composition ofmorphisms in a category. One consequence is that in the free symmetric monoidal closed 
category on any set of objects, the set of morphisms between any two objects is finite. There is also a decision 
procedure to tell when two morphisms are equal. For details, see Trimble’s thesis EH and the papers by Jay El 
and Soloviev l90l . Also see Kelly and Mac Lane’s coherence theorem for closed symmetric monoidal categories 
ED, and the related theorem for compact symmetric monoidal categories lf62l . 

MILL is just one of many closely related systems of logic. Most include extra features, but some subtract 
features. Here are just a few examples: 

• Algebraic theories. In his famous thesis, Lawvere 1691 defined an algebraic theory to be a cartesian 
category where every object is an n-fold cartesian power X x • • • x X (n > 0) of a specific object X. He 
showed how such categories regarded as logical theories of a simple sort — the sort that had previously 
been studied in ‘universal algebra’ (25 1. This work initiated the categorical approach to logic which we 
have been sketching here. Crole’s book 01 gives a gentle introduction to algebraic theories as well as 
some richer logical systems. More generally, we can think of any cartesian category as a generalized 
algebraic theory. 

• Intuitionistic linear logic (ILL). ILL supplements MILL with the operations familiar from intuitionistic 
logic, as well as an operation ! turning any proposition (or resource) X into an ‘indefinite stock of copies of 
X’. Again there is a nice category-theoretic interpretation. Bierman’s thesis ED gives a good overview, 
including a proof of cut elimination for ILL and a proof of the result, originally due to Girard, that 
intuitionistic logic can be be embedded in ILL. 

• Linear logic (LL). For full-fledged linear logic, the online review article by Di Cosmo and Miller 071 is a 
good place to start. For more, try the original paper by Girard 1431 and the book by Troelstra 11968 . Blute 
and Scott’s review article 1:24) serves as a Rosetta Stone for linear logic and category theory, and so do the 
lectures notes by Schalk (82). 

• Intuitionistic Logic (IL). Lambek and Scott’s classic book ED is still an excellent introduction to intu¬ 
itionistic logic and cartesian closed categories. The online review article by Moschovakis ED contains 
many suggestions for further reading. 

To conclude, let us say precisely what an ‘inference rule’ amounts to in the setup we have described. We 
have said it gives a function from a product of homsets to a homset. While true, this is not the last word on the 
subject. After all, instead of treating the propositions appearing in an inference rule as fixed, we can treat them 
as variable. Then an inference rule is really a ‘schema’ for getting new proofs from old. How do we formalize 
this idea? 

First we must realize that X h Y is not just a set: it is a set depending in a junctorial way on X and Y. As 
noted in Definition [21 there is a functor, the ‘horn functor’ 

hom:C op xC-» Set, 


46 




sending ( X , 7) to the homset hom(X, Y) = X h F. To look like logicians, let us write this functor as K 

Viewed in this light, most of our inference rules are natural transformations. For example, rule (a) is 
natural transformation between two functors from C op X C 3 to Set, namely the functors 

(W, X, Y, Z) 


and 


{W,X,Y,Z)^> W \-X®{Y®Z). 


This natural transformation turns any proof 


f:W 


( X®Y)®Z 


into the proof 

axjzf:W^>X®(Y®Z). 

The fact that this transformation is natural means that it changes in a systematic way as we vary W, X, Y and Z. 
The commuting square in the definition of natural transformation, Definition^ makes this precise. 

Rules (1), (r), (b) and (c) give natural transformations in a very similar way. The (®) rule gives a natural 
transformation between two functors from C op x C x C op x C to Set, namely 

(W,X, Y,Z) i-> (W h X) X (F h Z) 


and 


{W,X,Y,Z)^> W ®Y V X®Z. 


This natural transformation sends any element (/, g ) e hom( W, X) X hom(F, Z) to / ® g. 

The identity and cut rules are different: they do not give natural transformations, because the top line of 
these rules has a different number of variables than the bottom line! Rule (i) says that for each X e C there is a 
function 

ix: 1 X h X 

picking out the identity morphism l*. What would it mean for this to be natural in XI Rule (o) says that for 
each triple X,Y,Z e C there is a function 


°: (X h F) X (F h Z) ^ X\-Z. 

What would it mean for this to be natural in X, Y and Z? The answer to both questions involves a generalization 
of natural transformations called ‘dinatural’ transformations 17 1 1 . 

As noted in Definition!?] a natural transformation a:F => G between two functors F,G:C —> D makes 
certain squares in D commute. If in fact C = C° p X C2, then we actually obtain commuting cubes in D. Namely, 
the natural transformation a assigns to each object (X I ,X2) a morphism ax,,x 2 such that for any morphism 
(f\: Y\ —> X\, f2'. X2 —» F2) in C, this cube commutes: 
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Gilr.Ji) 

G(Y u X 2 ) ---- G(Y 1 ,Y 2 ) 



If Ci = C 2 , we can choose a single object X and a single morphism /: X —> 7 and use it in both slots. As 
shown in Figure^ there are then two paths from one comer of the cube to the antipodal comer that only involve 
a for repeated arguments: that is, ax,x and ayy, but not ayy or ayx- These paths give a commuting hexagon. 

This motivates the following: 

Definition 22 A dinatural transformation a:F => G between functors F, G: C op X C — » D assigns to every 
object X in C a morphism ax'. F(X, X) —> G(X, X) in D such that for every morphism f\X^> Y in C, the hexagon 
in Figure\I\cotmnutes. 

In the case of the identity rule, this commuting hexagon follows from the fact that the identity morphism is a 
left and right unit for composition: see Figure^] For the cut rule, this commuting hexagon says that composition 
is associative: see Figure^ 

So, in general, the sort of logical theory we are discussing involves: 

• A category C of propositions and proofs. 

• A functor h: C op X C —> Set sending any pair of propositions to the set of proofs leading from one to the 
other. 


• A set of dinatural transformations describing inference rules. 












G(Y,X) 


G(Y, Y ) 



F(X, X) F{X,Y) 


Figure 1: A natural transformation between functors F, G: C op xC^D gives a commuting cube in D for any 
morphism /: X —> Y, and there are two paths around the cube that only involve a for repeated arguments. 



Figure 2: Dinaturality of the (i) rule, where /: X —> Y. Here • e 1 denotes the one element of the one-element 
set. 
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X\-Z 

ho(fog) 


(A h 17) x (7i- Z) _ (/°-,1 fhz) (X h 7) x (7 h Z) 

(g, &) (/ ° g, h) 


X\-Z 

(hof)og 


XvZ 

(ho f)og = ho(fog) 


(X h IT) x 07 h Z) 
(g,W) 


Figure 3: Dinaturality of the cut rule, where f:W —» 7, g: X —> 17, h: Y —> Z. 

4 Computation 

4.1 Background 

In the 1930s, while Turing was developing what are now called ‘Turing machines’ as a model for computation, 
Church and his student Kleene were developing a different model, called the ‘lambda calculus’ 1231163). While 
a Turing machine can be seen as an idealized, simplified model of computer hardware, the lambda calculus is 
more like a simple model of software. 

By now the are many careful treatments of the lambda calculus in the literature, from Barendregt’s magis¬ 
terial tome ca to the classic category-theoretic treatment of Lambek and Scott Il67) . to Hindley and Seldin’s 
user-friendly introduction m and Selinger’s elegant free online notes (86). So, we shall content ourselves with 
a quick sketch. 

Poetically speaking, the lambda calculus describes a universe where everything is a program and everything 
is data: programs are data. More prosaically, everything is a ‘/l-term’, or ‘term’ for short. These are defined 
inductively: 

• Variables: there is a countable set of ‘variables’ x,y,z,... which are all terms. 

• Application: if / and t are terms, we can ‘apply’ / to t and obtain a term /(f). 

• Lambda-abstraction: if x is a variable and t is a term, there is a term (Ax.t). 
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Let us explain the meaning of application and lambda-abstraction. Application is simple. Since ‘programs 
are data’, we can think of any term either as a program or a piece of data. Since we can apply programs to data 
and get new data, we can apply any term / to any other term t and get a new term fit). 

Lambda-abstraction is more interesting. We think of (Ax.t) as the program that, given x as input, returns t as 
output. For example, consider 

(Ax.x(x)). 

This program takes any program x as input and returns x(x) as output. In other words, it applies any program to 
itself. So, we have 

(Ax.x(x))(s) = s(s) 

for any term s. 

More generally, if we apply (Ax.t) to any term ,y, we should get back t, but with s substituted for each free 
occurrence of the variable x. This fact is codified in a rule called beta reduction: 

(Ax.t)(s) = t[s/x\ 

where t[s/x\ is the term we get by taking t and substituting s for each free occurrence of x. But beware: this 
rule is not an equation in the usual mathematical sense. Instead, it is a ‘rewrite rule’: given the term on the left, 
we are allowed to rewrite it and get the term on the right. Starting with a term and repeatedly applying rewrite 
rules is how we take a program and let it run! 

There are two other rewrite rules in the lambda calculus. If x is a variable and t is a term, the term 
(Ax.t(x)) 

stands for the program that, given x as input, returns t(x) as output. But this is just a fancy way of talking about 
the program t. So, the lambda calculus has a rewrite rule called eta reduction, saying 

(Ax.t(x) | = t. 

The third rewrite rule is alpha conversion. This allows us to replace a bound variable in a term by another 
variable. For example: 

(Ax.x(x)) = (Ay.y(y)) 

since x is ‘bound’ in the left-hand expression by its appearance in ‘Ax’. In other words, x is just a dummy 
variable; its name is irrelevant, so we can replace it with y. On the other hand, 

(Ax.y(x)) * (Ax.ztx)). 

We cannot replace the variable y by the variable z here, since this variable is ‘free’, not bound. Some care must 
be taken to make the notions of free and bound variables precise, but we shall gloss over this issue, referring the 
reader to the references above for details. 

The lambda calculus is a very simple formalism. Amazingly, starting from just this, Church and Kleene 
were able to build up Boolean logic, the natural numbers, the usual operations of arithmetic, and so on. For 
example, they defined ‘Church numerals’ as follows: 

0 = (Af.(Ax.x)) 

1 = (Af.(Ax.f(x))) 

2 = (Af.(Ax.f(f(x)))) 

3 = (Af.(Ax.f(f(f(x)m 
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and so on. Note that / is a variable above. Thus, the Church numeral n is the program that ‘takes any program 
to the nth power’: if you give it any program / as input, it returns the program that applies / n times to whatever 
input x it receives. 

To get a feeling for how we can define arithmetic operations on Church numerals, consider 
Ag3(2(g)). 

This program takes any program g, squares it, and then cubes the result. So, it raises g to the sixth power. This 
suggests that 

Ag.3(2{g)) = 6. 

Indeed this is true. If we treat the definitions of Church numerals as reversible rewrite rules, then we can start 
with the left side of the above equation and grind away using rewrite rules until we reach the right side: 


(Ag3(2(g)) = {Ag3_{{AfAAx.f(f(x))))){g)) def.of2 

= (Ag3(Ax.g(g(x)))) beta 

= (Ag.(Af.(A X .f(f(f(x))MA X .g(g(xm def. of 3 

= (dg.(dx.(/lx.g(g(x)))((dx.g(g(x)))((/Ix.g(g(x)))(x))))) beta 

= (dg.(Ax.(dx.g(g(x)))((Tg.g(g(x)))(g(g(x)))))) beta 

= (Ag.(Ax.(Ax.g(g(x)))(g(g(g(g(x))))))) beta 

= (_Ag.(Ax.g(g(g(g(g(g(x)))))))) beta 

= 6 def. of 6 


If this calculation seems mind-numbing, that is precisely the point: it resembles the inner workings of a com¬ 
puter. We see here how the lambda calculus can serve as a programming language, with each step of computation 
corresponding to a rewrite rule. 

Of course, we got the answer 6 because 3x2 = 6. Generalizing from this example, we can define a program 
called ‘times’ that multiplies Church numerals: 

times = ( Aa.{Ab.{Ax.a{b{x ))))). 


For example, 


times(3)(2) = 6. 


The enterprising reader can dream up similar programs for the other basic operations of arithmetic. With more 
cleverness, Church and Kleene were able to write terms corresponding to more complicated functions. They 
eventually came to believe that all computable functions /: N —» N can be defined in the lambda calculus. 

Meanwhile, Godel was developing another approach to computability, the theory of ‘recursive functions’. 
Around 1936, Kleene proved that the functions definable in the lambda calculus were the same as Godel’s 
recursive functions. In 1937 Turing described his ‘Turing machines’, and used these to give yet another definition 
of computable functions. This definition was later shown to agree with the other two. Thanks to this and other 
evidence, it is now widely accepted that the lambda calculus can define any function that can be computed by 
any systematic method. We say it is ‘Turing complete’. 

After this burst of theoretical work, it took a few decades for programmable computers to actually be built. 
It took even longer for computer scientists to profit from Church and Kleene’s insights. This began around 
1958, when McCarthy invented the programming language Lisp, based on the lambda calculus ED- In 1965, an 
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influential paper by Landin ll68l pointed out a powerful analogy between the lambda calculus and the language 
ALGOL. These developments led to a renewed interest in the lambda calculus which continues to this day. 
By now, a number of computer languages are explicitly based on ideas from the lambda calculus. The most 
famous of these include Lisp, ML and Haskell. These languages, called ‘functional programming languages’, 
are beloved by theoretical computer scientists for their conceptual clarity. In fact, for many years, everyone 
majoring in computer science at MIT has been required to take an introductory course that involves programming 
in Scheme, a dialect of Lisp. The cover of the textbook for this course m even has a big A on the cover! 

We should admit that languages of a different sort — ‘imperative programming languages’ — are more pop¬ 
ular among working programmers. Examples include FORTRAN, BASIC, and C. In imperative programming, a 
program is a series of instructions that tell the computer what to do. By constrast, in functional programming, a 
program simply describes a function. To run the program, we apply it to an input. So, as in the lambda calculus, 
‘application’ is a fundamental operation in functional programming. If we combine application with lambda 
abstraction, we obtain a language powerful enough to compute any computable function. 

However, most functional programming languages are more regimented than the original lambda calculus. 
As we have seen, in the lambda calculus as originally developed by Church and Kleene, any term can be applied 
to any other. In real life, programming involves many kinds of data. For example, suppose we are writing a 
program that involves days of the week. It would not make sense to write 

times(3)(Tuesday) 

because Tuesday is not a number. We might choose to represent Tuesday by a number in some program, but 
doubling that number doesn’t have a good interpretation: is the first day of the week Sunday or Monday? Is the 
week indexed from zero or one? These are arbitrary choices that affect the result. We could let the programmer 
make the choices, but the resulting unstructured framework easily leads to mistakes. 

It is better to treat data as coming in various ‘types’, such as integers, floating-point numbers, alphanumeric 
strings, and so on. Thus, whenever we introduce a variable in a program, we should make a ‘type declaration’ 
saying what type it is. For example, we might write: 

Tuesday : day 

This notation is used in Ada, Pascal and some other languages. Other notations are also in widespread use. 
Then, our system should have a ‘type checker’ (usually part of the compiler) that complains if we try to apply a 
program to a piece of data of the wrong type. 

Mathematically, this idea is formalized by a more sophisticated version of the lambda calculus: the ‘typed’ 
lambda calculus, where every term has a type. This idea is also fundamental to category theory, where every 
morphism is like a black box with input and output wires of specified types: 



and it makes no sense to hook two black boxes together unless the output of the first has the same type as the 
input of the next: 
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Indeed, there is a deep relation between the typed lambda calculus and cartesian closed categories. This was 
discovered by Lambek in 1980 ll66l . Quite roughly speaking, a ‘typed lambda-theory’ is a very simple functional 
programming language with a specified collection of basic data types from which other more complicated types 
can be built, and a specified collection of basic terms from which more complicated terms can be built. The data 
types of this language are objects in a cartesian closed category, while the programs — that is, terms — give 
morphisms! 

Here we are being a bit sloppy. Recall from Section POI that in logic we can build closed monoidal categories 
where the morphisms are equivalence classes of proofs. We need to take equivalence classes for the axioms of 
a closed monoidal category to hold. Similarly, to get closed monoidal categories from computer science, we 
need the morphisms to be equivalence classes of terms. Two terms count as equivalent if they differ by rewrite 
rules such as beta reduction, eta reduction and alpha conversion. As we have seen, these rewrites represent the 
steps whereby a program carries out its computation. For example, in the original ‘untyped’ lambda calculus, 
the terms times(3)(2) and 6 differ by rewrite rules, but they give the same morphism. So, when we construct a 
cartesian closed category from a typed lambda-theory, we neglect the actual process of computation. To remedy 
this we should work with a cartesian closed 2-category which has: 

• types as objects, 

• terms as morphisms, 

• equivalence classes of rewrites as 2-morphisms. 

For details, see the work of Seely ll84l . Hilken EH, and Mellies E3- Someday this work will be part of the 
larger n-categorical Rosetta Stone mentioned at the end of Section 1231 

In any event, Lambek showed that every typed lambda-theory gives a cartesian closed category — and 
conversely, every cartesian closed category gives a typed lambda-theory. This discovery led to a rich line of 
research blending category theory and computer science. There is no way we can summarize the resulting 
enormous body of work, though it constitutes a crucial aspect of the Rosetta Stone. Two good starting points for 
further reading are the textbook by Crole lt34i and the online review article by Scott ll80l . 

In what follows, our goal is more limited. First, in Section 1431 we explain how every ‘typed lambda-theory’ 
gives a cartesian closed category, and conversely. We follow the treatment of Lambek and Scott < 571 . in a 
somewhat simplified form. Then, in Section POl we describe how every ‘linear type theory’ gives a closed 
symmetric monoidal category, and conversely. 

The idea here is roughly that a ‘linear type theory’ is a programming language suitable for both classical and 
quantum computation. This language differs from the typed lambda calculus in that it forbids duplication and 
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deletion of data except when expressly permitted. The reason is that while every object in a cartesian category 
comes equipped with ‘duplication’ and ‘deletion’ morphisms: 

Ax:X->X<g>X, ! Z :X->1, 

a symmetric monoidal category typically lacks these. As we saw in Section 1231 a great example is the category 
Hilb with its usual tensor product. So, a programming language suitable for quantum computation should not 
assume we can duplicate all types of data ED Illini- 

Various versions of ‘quantum’ or ‘linear’ lambda calculus have already been studied, for example by Benton, 
Bierman de Paiva and Hyland ED. Dorca and van Tonder and Selinger and Valiron 1881 . Abramsky and 
Tzevelekos sketch a version in their paper in this volume 0 . We instead explain the ‘linear type theories’ 
developed by Simon Ambler in his 1991 thesis 0 . 

4.2 The Typed Lambda Calculus 

Like the original ‘untyped’ lambda calculus explained above, the typed lambda calculus uses terms to represent 
both programs and data. However, now every term has a specific type. A program that inputs data of type X and 
outputs data of type Y is said to be of type X -o Y. So, we can only apply a term s to a term t of type X if s is of 
type X -o Y for some Y. In this case s(t) is a well-defined term of type Y. We call I^fa function type. 

Whenever we introduce a variable, we must declare its type. We write t : X to mean that t is a term of type X. 
So, in lambda abstraction, we no longer simply write expressions like (Ax. t). Instead, if x is a variable of type 
X, we write 

(Ax:X. t). 

For example, here is a simple program that takes a program of type X -o X and ‘squares’ it: 

(Af:X^>X.(Ax:X. /(/(*)))). 

In the original lambda calculus, all programs take a single piece of data as input. In other words, they 
compute unary functions. This is no real limitation, since we can handle functions that take more than one 
argument using a trick called ‘currying’, discussed in Section l2~6l This turns a function of several arguments into 
a function that takes the first argument and returns a function of the remaining arguments. We saw an example in 
the last section: the program ‘times’. For example, times(3) is a program that multiplies by 3, so times(3)(2) = 6. 

While making all programs compute unary functions is economical, it is not very kind to the programmer. 
So, in the typed lambda calculus we also introduce products: given types X and Y, there is a type X x Y called a 
product type. We can think of a datum of type X x Y as a pair consisting of a datum of type X and a datum of 
type Y. To make this intuition explicit, we insist that given terms s : X and t: Y there is a term (,v, t): XxY. We 
also insist that given a term u : XxY there are terms p(u ) : X and p'(u) : Y, which we think of as the first and 
second components of the pair t. We also include rewrite rules saying: 

( p(u ), p'(u)) = u for all u : X x Y, 

p(s, t) - s for all s : X and t : Y, 
p'(s,t ) = t for all s :X and t :Y. 

Product types allow us to write programs that take more than one input. Even more importantly, they let 
us deal with programs that produce more than one output. For example, we might have a type called ‘integer’. 
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Then we might want a program that takes an integer and duplicates it: 


duplicate : integer -o (integer x integer) 


Such a program is easy to write: 

duplicate = (Ax :integer . (x,x)). 

Of course this a program we should not be allowed to write when duplicating information is forbidden, but in 
this section our considerations are all ‘classical’, i.e., suited to cartesian closed categories. 

The typed lambda calculus also has a special type called the ‘unit type’, which we denote as 1. There is 
a single term of this type, which we denote as (). From the viewpoint of category theory, the need for this 
type is clear: a category with finite products must have not only binary products but also a terminal object (see 
DetinitionlTni. For example, in the category Set, the terminal object can be taken as any one-element set, and () 
is the unique element of this set. It may be less clear why this type is useful in programming. One reason is that 
it lets us think of a constant of type X as a function of type 1 -o X — that is, a ‘nullary’ function, one that takes 
no arguments. There are some other reasons, but they go beyond the scope of this discussion. Suffice it to say 
that Haskell, Lisp and even widely used imperative languages such as C, C++ and Java include the unit type. 

Having introduced the main ingredients of the typed lambda calculus, let us give a more formal treatment. 
As we shall see, a ‘typed lambda-theory’ consists of types, terms and rewrite rules. From a typed lambda-theory 
we can get a cartesian closed category. The types will give objects, the terms will give morphisms, and the 
rewrite rules will give equations between morphisms. 

First, the types are given inductively as follows: 

• Basic types: There is an arbitarily chosen set of types called basic types. 

• Product types: Given types X and Y, there is a type X x Y. 

• Function types: Given types X and Y, there is a type X -o Y. 

• Unit type: There is a type 1. 

There may be unexpected equations between types: for example we may have a type X satisfying XxX = X. 
However, we demand that: 

• If X = X' and Y = Y' then XxY = X' xY'. 

• If X = X' and Y = Y' then X -o Y = X' -o Y'. 

Next we define terms. Each term has a specific type, and if t is a term of type X we write t:X. The rules for 
building terms are as follows: 

• Basic terms: For each type X there is a set of basic terms of type X. 

• Variables: For each type X there is a countably infinite collection of terms of type X called variables of 
type X. 

• Application: If / : X -o Y and t: X then there is a term f(t) of type T. 
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• Lambda abstraction: If x is a variable of type X and t : Y then there is a term (Ax:X. t) of type X -o Y. 

• Pairing: If s : X and t: Y then there is a term (s, t) of type X x Y. 

• Projection: If t: X x X' then there is a term pit) of X and a term p'(t) of type X'. 

• Unit term: There is a term () of type 1. 

Finally there are rewrite rules going between terms of the same type. Given any fixed set of variables S, 
there will be rewrite rules between terms of the same type, all of whose free variables lie in the set S. For our 
present purposes, we only need these rewrite rules to decide when two terms determine the same morphism in 
the cartesian closed category we shall build. So, what matters is not really the rewrite rules themselves, but the 
equivalence relation they generate. We write this equivalence relation as s ~s t. 

The relation ~s can be any equivalence relation satisfying the following list of rules. In what follows, t[s/x\ 
denotes the result of taking a term t and replacing every free occurence of the variable x by the term s. Also, 
when when we say ‘term’ without further qualification, we mean ‘term all of whose free variables lie in the set 
S\ 


• Type preservation: If t f then t and r' must be terms of the same type, all of whose free variables lie 
in the set S. 

• Beta reduction: Suppose x is a variable of type X, s is a term of type X, and t is any term. If no free 
occurrence of a variable in s becomes bound in t[s/x\, then: 

(Ax:X ,t)(s) ~s t[s/x\. 

• Eta reduction: Suppose the variable x does not appear in the term /. Then: 

(Ax:X.f(x)) ~ s f. 

• Alpha conversion: Suppose x and y are variables of type X, and no free occurrence of any variable in t 
becomes bound in f[x/y]. Then: 

(Ax:X.t) ~s (Ty:X.t[x/y]). 

• Application: Suppose t and t' are terms of type X with t ~s f, and suppose that /: A -o Y. Then: 

m ~s 

• Lambda abstraction: Suppose t and f are terms of type Y, all of whose free variables lie in the set 
S U {x}. Suppose that t -sum ? ■ Then: 


(Ax:X .t) ~s (Ax:X.f) 
• Pairing: If u is a term of type X x F then: 


(p(u),p'(u)) ~s U. 
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• Projection: if s is a term of type X and t is a term of type Y then: 

p(s,t) ~s s 
p'(s,t) ~s t. 

• Unit term: If t is a term of type 1 then: 

t ~s ()• 

Now we can describe Lambek’s classic result relating typed lambda-theories to cartesian closed categories. 
From a typed lambda-theory we get a cartesian closed category C for which: 

• The objects of C are the types. 

• The morphisms /: X —» Y of C are equivalence classes of pairs (x, t ) consisting of a variable x: X and a 
term t : Y with no free variables except perhaps x. Here (jc, f) is equivalent to (x', f) if and only if: 

t ~{4 *'[*/*']• 

• Given a morphism /: X —> T coming from a pair (x, t) and a morphism g: Y —> Z coming from a pair (y , u ) 
as above, the composite gf: X —» Y comes from the pair (x, u[t/y]). 

We can also reverse this process and get a typed lambda-theory from a cartesian closed category. In fact, Lambek 
and Scott nicely explain how to construct a category of category of cartesian closed categories and a category of 
typed-lambda theories. They construct functors going back and forth between these categories and show these 
functors are inverses up to natural isomorphism. We thus say these categories are ‘equivalent’ H57I . 

4.3 Linear Type Theories 

In his thesis CD , Ambler described how to generalize Lambek’s classic result from cartesian closed categories to 
closed symmetric monoidal categories. To do this, he replaced typed lambda-theories with ‘linear type theories’. 
A linear type theory can be seen as a programming language suitable for both classical and quantum computa¬ 
tion. As we have seen, in a noncartesian category like Hilb, we cannot freely duplicate or delete information. 
So linear type theories must prevent duplication or deletion of data except when it is expressly allowed. 

To achieve this, linear type theories must not allow us to write a program like this: 

(Ax-.X .(x,x)). 

Even a program that ‘squares’ another program, like this: 

(Af:X -o A . (Ax :X . /(ft*)))), 

is not allowed, since it ‘reuses’ the variable /. On the other hand, a program that composes two programs is 
allowed! 

To impose these restrictions, linear type theories treat variables very differently than the typed lambda calcu¬ 
lus. In fact, in a linear type theory, any term will contain a given variable at most once. But linear type theories 
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depart even more dramatically from the typed lambda calculus in another way. They make no use of lambda 
abstraction! Instead, they use ‘combinators’. 

The idea of a combinator is very old: in fact, it predates the lambda calculus. Combinatory logic was born 
in a 1924 paper by Schonfinkel E3, and was rediscovered and extensively developed by Curry m starting 
in 1927. In retrospect, we can see their work as a stripped-down version of the untyped lambda calculus that 
completely avoids the use of variables. Starting from a basic stock of terms called ‘combinators’, the only way 
to build new ones is application: we can apply any term / to any term t and get a term f(t). 

To build a Turing-complete programming language in such an impoverished setup, we need a sufficient 
stock of combinators. Remarkably, it suffices to use three. In fact it is possible to use just one cleverly chosen 
combinator — but this tour de force is not particularly enlightening, so we shall describe a commonly used set 
of three. The first, called /, acts like the identity, since it comes with the rewrite rule: 

1(a) = a 

for every term a. The second, called K , gives a constant function K(a) for each term a. In other words, it comes 
with a rewrite rule saying 

K(a)(b) = a 

for every term b. The third, called S , is the tricky one. It takes three terms, applies the first to the third, and 
applies the result to the second applied to the third: 

S(a)(b)(c) = a(c)(b(c)). 

Later it was seen that the combinator calculus can be embedded in the untyped lambda calculus as follows: 

I = (Ax.x) 

K = (Ax.(Ay.x)) 

S = (Ax.(Ay.(Az.x(z)(y(z))))). 

The rewrite rules for these combinators then follow from rewrite rules in the lambda calculus. More surprisingly, 
any function computable using the lambda calculus can also be computed using just /, K and S ! While we do 
not need this fact to understand linear type theories, we cannot resist sketching the proof, since it is a classic 
example of using combinators to avoid explicit use of lambda abstraction. 

Note that all the variables in the lambda calculus formulas for /, K, and S are bound variables. More gen¬ 
erally, in the lambda calculus we define a combinator to be a term in which all variables are bound variables. 
Two combinators c and d are extensionally equivalent if they give the same result on any input: that is, for any 
term t, we can apply lambda calculus rewrite rules to c(t) and d(t) in a way that leads to the same term. There 
is a process called ‘abstraction elimination’ that takes any combinator in the lambda calculus and produces an 
extensionally equivalent one built from /, K, and S. 

Abstraction elimination works by taking a term t = (Ax.u) with a single lambda abstraction and rewriting 
it into the form (Ax.f(x)), where / has no instances of lambda abstraction. Then we can apply eta reduction, 
which says (Ax.f(x)) = f. This lets us rewrite t as a term / that does not involve lambda abstraction. We shall 
use the notation [[u\\ x to mean ‘any term / satisfing f(x) = u’. 

There are three cases to consider; each case justifies the definition of one combinator: 

1 . t = (Ax.x). We can rewrite this as t = (Ax.I(x)), so t = [[x]], = I. 
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2. t = ( Ax.u ), where u does not depend on x. We can rewrite this as t = (Ax.K(u)(x)), so t = [[u\] x = K(u). 

3. t = ( Ax.uiv )), where u and v may depend on x. We can rewrite this as t = (Ax.(([[u]] x x)([[v]] x x)) or 
t = (dx.S([ML)([[v]L)(x)), SO t = S([ML)([[v]L). 


We can eliminate all use of lambda abstraction from any term by repeatedly using these three rules ‘from the 
inside out’. To see how this works, consider the lambda term t = (Ax.(Ay.y)), which takes two inputs and returns 
the second. Using the rules above we have: 


(Ax.Uy.y)) 


(Tx.(Ty.[[y]] y (y))) 

(Ax.(Ay.I(y))) 

(Ax.I) 

{Ax.mux)) 

{Ax.K{l){x) 

K(I). 


We can check that it works as desired: K(I)(x)(y) = I(y) = y. 

Now let us return to our main theme: linear type theories. Of the three combinators described above, only I 
is suitable for use in an arbitrary closed symmetric monoidal category. The reason is that K deletes data, while 
S duplicates it. We can see this directly from the rewrite rules they satisfy: 

K{a)(b) = a 

S{a){b){c) = a(c)(b(c)). 


Every linear type theory has a set of ‘basic combinators’, which neither duplicate nor delete data. Since 
linear type theories generalize typed lambda-theories, these basic combinators are typed. Ambler writes them 
using notation resembling the notation for morphisms in category theory. 

For example, given two types X and Y in a linear type theory, there is a tensor product type X ® Y. This is 
analogous to a product type in the typed lambda calculus. In particular, given a term s of type X and a term t of 
type T, we can combine them to form a term of type X ® Y, which we now denote as (5 ® t). We reparenthesize 
iterated tensor products using the following basic combinator: 

assoc x , y>z : (X®Y)®Z -> X®(Y®Z). 

This combinator comes with the following rewrite rule: 

assocx,r,z(0 ®t)®u) = (s®(t® u )) 


for all terms s : X, t: Y and u : Z. 

Of course, the basic combinator assoc^Kz is just a mildly disguised version of the associator, familiar from 
category theory. Indeed, all the basic combinators come from natural or dinatural transformations implicit in 
the definition of ‘closed symmetric monoidal category’. In addition to these, any given linear type theory also 
has combinators called ‘function symbols’. These come from the morphisms particular to a given category. 
For example, suppose in some category the tensor product X ® X is actually the cartesian product. Then the 
corresponding linear type theory should have a function symbol 
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which lets us duplicate data of type X, together with function symbols 


p:X®X^X, p':X®X^X 

that project onto the first and second factors. To make sure these work as desired, we can include rewrite rules: 

A(s) = (s®s) 

p(s®t) = s 
p'{s®t) = t. 

So, while duplication and deletion of data are not a ‘built-in feature’ of linear type theories, we can include them 
when desired. 

Using combinators, we could try to design a programming language suitable for closed symmetric monoidal 
categories that completely avoid the use of variables. Ambler follows a different path. He retains variables in 
his formalism, but they play a very different — and much simpler — role than they do in the lambda calculus. 
Their only role is to help decide which terms should count as equivalent. Furthermore, lambda abstraction plays 
no role in linear type theories, so the whole issue of free versus bound variables does not arise! In a sense, all 
variables are free. Moreover, every term contains any given variable at most once. 

After these words of warning, we hope the reader is ready for a more formal treatment of linear type theories. 
A linear type theory has types, combinators, terms, and rewrite rules. The types will correspond to objects in a 
closed symmetric monoidal category, while equivalence classes of combinators will correspond to morphisms. 
Terms and rewrite rules are only used to define the equivalence relation. 

First, the set of types is defined inductively as follows: 

• Basic types: There is an arbitarily chosen set of types called basic types. 

• Product types: Given types X and T, there is a type ( X ® Y). 

• Function types: Given types X and Y, there is a type (X -o Y). 

• Trivial type: There is a type I. 

There may be equations between types, but we require that: 

• If X = X' and Y = Y' then X®Y = X' ® Y'. 

• If X = X' and Y = Y' then X -o Y = X' -oY'. 

Second, a linear type theory has for each pair of types X and Y a set of combinators of the form /: X —» Y. 
These are defined by the following inductive rules: 

• Given types X and Y there is an arbitrarily chosen set of combinators f:X —> T called function symbols. 

• Given types X, Y, and Z we have the following combinators, called basic combinators: 

- id x :X^X 

- assocx,y,z: {X®Y)®Z^>X®{Y®Z) 
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- unassoc z ,y,z: X ® (F ® Z) -> (X ® Y) ® Z 

- braid x ,r:X®Y-> Y®X 

- left x :I®X^>X 

- unleft x :X-> I®X 

- rigl%:X ® / —> X 

- unright x : X —> X ® 1 

- eval X j:X®(X^>Y) -> Y 

• If /: X —> F and g:Y —> Z are combinators, then (g o /): X —» Z is a combinator. 

• If /: X —» F and g: X' —> F' are combinators, then (/ ® g): X ® X' —> Y ® Y' is a combinator. 

• If /: X ® Y —» Z is a combinator, then we can curry / to obtain a combinator /: F —> (X -o Z). 

It will generally cause no confusion if we leave out the subscripts on the basic combinators. For example, we 
may write simply ‘assoc’ instead of assocx,r,z- 

Third, a linear type theory has a set of terms of any given type. As usual, we write t : X to say that t is a 
term of type X. Terms are defined inductively as follows: 

• For each type X there is a countably infinite collection of variables of type X. If x is a variable of type X 
then x \X. 

• There is a term 1 with 1 : 7. 

• If .y : X and t : F, then there is a term (.v <g> t) with (s ® t) : X ® Y, as long as no variable appears in both s 
and t. 

• If /: X —» F is a combinator and t: X then there is a term f(t) with /(f): X. 

Note that any given variable may appear at most once in a term. 

Fourth and finally, a linear type theory has rewrite rules going between terms of the same type. As in our 
treatment of the typed lambda calculus, we only care here about the equivalence relation ~ generated by these 
rewrite rules. This equivalence relation must have all the properties listed below. In what follows, we say a term 
is basic if it contains no combinators. Such a term is just an iterated tensor product of distinct variables, such as 

(z® ((x®y)®w)). 

These are the properties that the equivalence relation ~ must have: 

• If t ~ f then t and t' must be terms of the same type, containing the same variables. 

• The equivalence relation is substitutive: 

- Given terms s ~ s', a variable x of type X, and terms t ~ t' of type X whose variables appear in 
neither 5 nor s', then s[t/x\ ~ s'[f / x]. 
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- Given a basic term t with the same type as a variable x, if none of the variables of t appear in the 
terms s or s', and s[t/x\ ~ s’\t[x\, then s ~ s'. 

• The equivalence relation is extensional: if f : X -o Y, g : X -o Y and eval(f ® /) ~ eval(t <g> g) for all 
basic terms t : X, then f ~ g. 

• We have: 

- id(i) ~ s 

- (8 o f)(s) ~ g(f(s)) 

- (/®*)(s®0-</(«)®*(0) 

- assoc((s ® t) ® u) ~ (5 ® (t ® n)) 

- unassoc(s ® (t 0 u )) ~ ((s ® t) ® u) 

- braidt.y ®t) ~ (t® s) 

- left(l ® s) ~ s 

- unleft(.y) ~ (1 ® s) 

- right(i® 1) ~ s 

- unright(i) ~ (s ® 1) 

- eval(.y ® fit)) ~ f(s ® t) 

Note that terms can have variables appearing anywhere within them. For example, if x, y, z are variables of 
types X, Y and Z, and f\Y®Z^> W is a function symbol, then 

braidtx ® f(y ® z)) 

is a term of type W ® X. However, every term t is equivalent to a term of the form cp(f)(vp(f)), where cp(f) is 
the combinator part of t and vp(t) is a basic term called the variable part of t. For example, the above term is 
equivalent to 

braid o (id ® (/ o (id ® id)))(x <8> (y <8> z)). 

The combinator and variable parts can be computed inductively as follows: 

• If x is a variable of type X, cp(x) = id: X —> X. 

• cp(l) = id: / —> I. 

• For any terms ,v and t, cp(.y ® t) = cp(.s) ® cp(t). 

• For any term ,y : X and any combinator / : X —» Y, cp(/(.y)) = / o cp(.y). 

• If x is a variable of type X, vp(x) = x. 

• vp(l) = 1. 

• For any terms ,y and t, vp(.v ®t) = vp(.s) ® vp (t). 
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• For any term s : X and any combinator f:X —> Y, vp (f(s)) = vp(s). 


Now, suppose that we have a linear type theory. Ambler’s first main result is this: there is a symmetric 
monoidal category where objects are types and morphisms are equivalence classes of combinators. The equiv¬ 
alence relation on combinators is defined as follows: two combinators f,g:X —> Y are equivalent if and only 
if 

M ~ git) 

for some basic term t of type X. In fact. Ambler shows that fit) ~ g(t) for some basic term t : X if and only if 
f(t) ~ gif) for all such basic terms. 

Ambler’s second main result describes how we can build a linear type theory from any closed symmetric 
monoidal category, say C. Suppose C has composition □, tensor product •, internal horn and unit object t. 
We let the basic types of our linear type theory be the objects of C. We take as equations between types those 
generated by: 

• i = I 

• A•B = A® B 

• A -• B = A -o B 

We let the function symbols be all the morphisms of C. We take as our equivalence relation on terms the smallest 
allowed equivalence relation such that: 

• \ A (x) ~ A 

• ( gaf)(x)~g(f(x )) 

• (/ • g)(x ®y)~ (fix) ® g(y)) 

• aA,B,ciix ®y)®z.)~(x®(y® z)) 

• b A ,Bix ® y) ~ (y ® x) 

• l A (l®x)~x 

• r A (x® l)~x 

• ev A ,fi(x ® fiy)) ~ fix ® y) 

Then we define 

• id = 1 

• assoc = a 

• unassoc = cT 1 

• braid = b 

• left = l 
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unleft = / 1 


• right = r 

• unleft = r~ l 

• eval = ev 

• gof = gaf 
and we’re done! 

Ambler also shows that this procedure is the ‘inverse’ of his procedure for turning linear type theories into 
closed symmetric monoidal categories. More precisely, he describes a category of closed symmetric monoidal 
categories (which is well-known), and also a category of linear type theories. He constructs functors going back 
and forth between these, based on the procedures we have sketched, and shows that these functors are inverses 
up to natural isomorphism. So, these categories are ‘equivalent’. 

In this section we have focused on closed symmetric monoidal categories. What about closed categories that 
are just braided monoidal, or merely monoidal? While we have not checked the details, we suspect that pro¬ 
gramming languages suited to these kinds of categories can be obtained from Ambler’s formalism by removing 
various features. To get the braided monoidal case, the obvious guess is to remove Ambler’s rewrite rule for the 
‘braid’ combinator and add two rewrite rules corresponding to the hexagon equations (see Section l2~4l for these). 
To get the monoidal case, the obvious guess is to completely remove the combinator ‘braid’ and all rewrite rules 
involving it. In fact, Jay ED gave a language suitable for closed monoidal categories in 1989; Ambler’s work 
is based on this. 


5 Conclusions 

In this paper we sketched how category theory can serve to clarify the analogies between physics, topology, 
logic and computation. Each field has its own concept of ‘thing’ (object) and ‘process’ (morphism) — and 
these things and processes are organized into categories that share many common features. To keep our task 
manageable, we focused on those features that are present in every closed symmetric monoidal category. Table 
0 an expanded version of the Rosetta Stone, shows some of the analogies we found. 

However, we only scratched the surface! There is much more to say about categories equipped with extra 
structure, and how we can use them to strengthen the ties between physics, topology, logic and computation 
— not to mention what happens when we go from categories to ^-categories. But the real fun starts when we 
exploit these analogies to come up with new ideas and surprising connections. Here is an example. 

In the late 1980s, Witten l99l realized that string theory was deeply connected to a 3d topological quantum 
field theory and thus the theory of knots and tangles ll65l . This led to a huge explosion of work, which was 
ultimately distilled into a beautiful body of results focused on a certain class of compact braided monoidal 
categories called ‘modular tensor categories’ dim 

All this might seem of purely theoretical interest, were it not for the fact that superconducting thin films in 
magnetic fields seem to display an effect — the ‘fractional quantum Hall effect’ — that can be nicely modelled 
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Category Theory 

Physics 

Topology 

Logic 

Computation 

object X 

Hilbert space X 

manifold X 

proposition X 

data type X 

morphism 
f:X Y 

operator 

f:X^Y 

cobordism 
f:X Y 

proof 

f-.X^Y 

program 
f:X —¥ Y 

tensor product 
of objects: 
X®Y 

Hilbert space 
of joint system: 
X®Y 

disjoint union 
of manifolds: 
X®Y 

conjunction 
of propositions: 
X®Y 

product 
of data types: 
X®Y 

tensor product of 
morphisms: f®g 

parallel 

processes: f®g 

disjoint union of 
cobordisms: f®g 

proofs carried out 
in parallel: f ®g 

programs executing 
in parallel: f ® g 

internal horn: 

X -o Y 

Hilbert space of 
‘anti-A" and Y’: 
X*®Y 

disjoint union of 
orientation-reversed 
X and Y:X*®Y 

conditional 

proposition: 

X -o Y 

function type: 

X -°Y 


Table 4: The Rosetta Stone (larger version) 


with the help of such categories 19211931 . In a nutshell, the idea is that excitations of these films can act like 
particles, called ‘anyons’. When two anyons trade places, the result depends on how they go about it: 

x-x 


So, collections of anyons are described by objects in a braided monoidal category! The details depend on 
things like the strength of the magnetic field; the range of possibilities can be worked out with the help of 
modular tensor categories 17611791 . 

So far this is all about physics and topology. Computation entered the game around 2000, when Freedman, 
Kitaev, Larsen and Wang ED showed that certain systems of anyons could function as ‘universal quantum 
computers’. This means that, in principle, arbitrary computations can be carried out by moving anyons around. 
Doing this in practice will be far from easy. However, Microsoft has set up a research unit called Project Q 
attempting to do just this. After all, a working quantum computer could have huge practical consequences. 

But regardless of whether topological quantum computation ever becomes practical, the implications are 
marvelous. A simple diagram like this: 



can now be seen as a quantum process, a tangle, a computation — or an abstract morphism in any braided 
monoidal category! This is just the sort of thing one would hope for in a general science of systems and 
processes. 
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